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METHODS AND ORGANISMS FOR 
PRODUCTION OF B6 VITAMERS 



5 Related Applications 

This application claims priority to U.S. Provisional Application Serial 
No. (Serial No. not yet received) entitled "METHODS AND ORGANISMS FOR 
PRODUCTION OF B6 VITAMERS" (Atty Ref. No. BGI-152-4), filed on March 3, 
2003, U.S. Provisional Application Serial No. 60/367,089 filed on March 22, 2002, U.S. 
1 0 Provisional Application Serial No. 60/367,863 filed on March 25, 2002, and U.S. 
Provisional Application Serial No. 60/368,61 8 filed on March 29, 2002. The entire 
content of the above-referenced applications is incorporated herein by this reference. 



Background of the Invention 

1 5 Vitamin B6, also known as pyridoxine or pyridoxol (PN), or one of a 

number of closely related compounds, is an essential dietary nutrient for most, if not all, 
animals, while many microorganisms (bacteria, fungi, algae, etc.) and plants are capable 
of synthesizing their own vitamin B6 or compound(s) related to vitamin B6. When an 
animal ingests PN or a related compound that has vitamin B6 activity, the compound is 
20 converted ultimately into pyridoxal phosphate (PLP) and/or pryidoxamine phosphate 
(PMP), which are the active forms of vitamin B6 in all living organisms. PLP acts as a 
cofactor for many important or essential enzymes in all living, organisms, including 
transaminases, racemases, and decarboxylases. PLP and PMP are easily interconverted 
by ubiquitous transaminases. 
25 Vitamin B6 is of commercial importance in vitamin pills, pharmaceutical 

applications, and as an animal feed additive that enhances growth or desirable growth 
characteristics in farm and domestic animals. The currently used commercial process 
for producing vitamin B6 is a synthetic chemical process. However, a fermentation 
process using a microorganism (see US Patent application No. 09/667,569, filed 
30 September 21, 2000, hereby incorporated in its entirety by reference) or a biosynthetic 
process using a plant species can be more cost effective in the long run, and may be 
environmentally more attractive. 

The biosynthetic pathway for PLP in E. coli has been elucidated 
(reviewed in Mittengruber, G, (2001) J. Mol. Microbiol. Biotechnol. 3(1): 1-20; Cane, 
35 D.E., et al. (2000) J. Am. Chem. Soc. 122: 4213-4214; Man, T-K, et al., (1996) J. 

Bacteriol. 178: 2445-2449). Enzymes encoded by the genes epd, pdxB, pdxF, andpdxA 
lead to synthesis of the precursor l-hydroxy-3-amino acetone phosphate from erythrose- 
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4-phosphate and glutamate. The enzyme encoded by dxs leads to the precursor 5'- 
deoxyxylulose phosphate from glycolytic intermediates. The enzyme encoded by pdxJ 
then catalyzes the chemical coupling of the two precursors to give pyridoxol phosphate 
(also called pyridoxine phosphate or PNP). PNP is then oxidized to the active form, 
5 PLP, by the enzyme encoded by pdxH. This biosynthetic pathway to PLP in E. coli, as 
well as closely related pathways, are referred to herein as the Type A Pathway. Partially 
characterized mutants of E. coli have been described that produce about three- to seven- 
fold more vitamin B 6 -related compounds than the parent strain (Dempsey and Arcement 
(1971) J. Bacterid. 107(2^): 580-582). Partially characterized mutants of B. subtilis have 
1 0 been reported that produce 1 - 5 mg/1 vitamin B 6 , but it was not stated what level the 
parent strain produced (Pflug, W., and Lingens, F., (1978) Hoppe-Seyler's Z. Physiol. 
Chem. 359: 559-570). Notably, these organisms were not recombinantly produced. 

A second biosynthetic pathway for vitamin B6, referred to herein as the 
Type B pathway, may exist in some organisms other than E. coli (Mittengruber, G., 
1 5 (2001) J. Mol Microbiol Biotechnol. 3(1): 1-20). In particular, some fungi (for example 
from the genera Cercospora, Neurospora, Aspergillus and Saccharomyces), some 
bacteria (for example B. subtilis and Staphylococcus aureus), and all plants for which 
data exists do not contain any genes that are highly homologous to E. colipdxA and 
pdxJ. Instead, these organisms contain genes that are homologous to Cercospora genes 
20 named SOR (or SNZ) and SNO. In Saccharomyces, these homologs are called PDX1 and 
PDX2, respectively, and in B. subtilis, these homologs are named yaaD and yaaE, 
respectively. Protein or DNA sequence homology alone is not sufficient to establish 
biological function. For example, B. subtilis contains a gene, yhaF, that encodes a 
protein that is significantly homologous to E. coli pdxF. However, when yhaF is 
25 mutated, the resulting mutant B. subtilis strain is a serine auxotroph, but not a PL 

auxotroph (see Example 3, below). Thus, the identification of a gene or genes involved 
in PLP biosynthesis in any given organism can not be done using sequence homology 
alone. However, the identification of sequence homology between genes, in 
combination with the presence of one or more common biological activities, maybe 
30 used to identify homologs of genes involved in PLP biosynthesis. 

Results from 13 C and 15 N labeling studies suggest that the precursors that 
provide the carbon and nitrogen atoms in PL and related compounds are different in E. 
coli and Saccharomyces cerevisiae (Gupta, R., et al. (2001) J. Am Chem. Soc. 123: 
11353-11359; Tayuza, K., et al. (1995) Biochim. Biophys. Acta 1244 : 113-116.) 
35 However, the identity of the precursors for PL and related compounds in S. cerevisiae is 
not yet known. Since most microorganisms for which the entire genome sequence is 
known (for example E. coli, S. cerevisiae and B. subtilis) have either pdxA sndpdxJ 
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homologs or SOR and SNO homologs, but not both, it appears that most organisms that 
are capable of synthesizing PLP have either the well characterized Type A Pathway (for 
example E. coli, Salmonella typhimurium, and many other genera), or a distinctly 
different and incompletely characterized pathway, e.g., the Type B Pathway. 

5 Specifically, members of the genera Cercospora, Neurospora, Aspergillus, 

Saccharomyces, Bacillus, Arabidopsis, and many other genera, appear to have a Type B 
pathway, and are lacking some of the genes involved in the Type A Pathway. The 
intermediate compounds in the Type B Pathway have not yet been elucidated, although 
the final product must be PLP (as for the Type A Pathway) or PMP, since these are the 

1 0 active forms of vitamin B6 in all known organisms. 



Summary of the Invention 

The present invention is based, at least in part, on the discovery of key 
enzyme-encoding genes of the B6 vitamer biosynthetic pathways in, e.g., Bacillus 

1 5 subtilis. In particular, the invention is based, at least in part, on the discovery that the 
yaaD and yaaE genes oiB. subtilis, or homologues thereof, are required for B6 vitamer 
synthesis. The invention is further based on the discovery that the biosynthesis of 
vitamers by an organism can be increased by increasing the level of one or more 
enzymes involved in B6 vitamer synthesis. 

20 Deletion of a portion of the yaaD and yaaE genes (which are adjacent in 

an operon, e.g., the yaaDE operon) leads to PL auxotrophy. Overexpression of the 
yaaDE operon or the deregulation of the expression of the yaaD and yaaE genes leads to 
significantly increased production of B6 vitamers in, e.g., B. subtilis strains. The B. 
subtilis yaaDE operon is required for pyridoxal phosphate (PLP) biosynthesis, an active 

25 form of vitamin B6 in all living organisms. The present invention describes that the 
expression of the B. subtilis yaaDE operon is a rate limiting step for production of 
compounds related to vitamin B6 in a wild type strain. 

Accordingly, the present invention features methods of producing B6 
vitamers, including, but not limited to, pyridoxine (or pyridoxol (PN)), pyridoxal (PL), 

30 pyridoxamine (PM), or the 5' phosphorylated derivatives of any of the three 

aforementioned compounds (PNP, PLP, and PMP), using organisms in which the B6 
vitamer pathway has been manipulated such that B6 vitamers are produced. Such 
methods include culturing an organism, e.g., a microorganism that overexpresses at least 
one B6 vitamer biosynthetic enzyme under conditions such that the B6 vitamer is 

35 produced. , 
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Based on the discovery of the activity of the polypeptides encoded by the 
yaaD and yaaE genes, or homologues thereof, in B6 vitamer synthesis, the invention is 
also based, at least in part, on the discovery that modulation of YaaD and/or YaaE 
activity (e.g., YaaD and YaaE activity or YaaD activity) results in the modulation of B6 
5 vitamer production. Accordingly, in one aspect, the invention includes a method for 
producing a B6 vitamer comprising culturing an organism with an increased YaaD 
and/or YaaE activity (e.g, YaaD and YaaE activity or YaaD activity) as compared to 
the parent organism. In one embodiment, increased YaaD and/or YaaE activity (e.g, 
YaaD and YaaE activity or YaaD activity) is due to increased expression of a nucleic 
10 acid molecule encoding a YaaD polypeptide and/or YaaE polypeptide (e.g., a YaaD 
polypeptide and a YaaE polypeptide or a YaaD polypeptide) as compared to an 
unmodified parent organism. In another embodiment, increased yaaD and/or yaaE 
nucleic acid molecule expression (e.g., yaaD and yaaE nucleic acid expression or yaaD 
nucleic acid expression), is due to the introduction of nucleic acid molecules encoding a 
15 YaaD polypeptide and/or a YaaE polypeptide into the organism. 

In another aspect, the invention provides methods for producing a B6 
vitamer comprising culturing an organism with an increased Epd, PdxA, PdxJ, PdxF, 
PdxB, PdxH, or Dxs activity as compared to the parent micro. In one embodiment, 
increased Epd, PdxA, PdxJ, PdxF, PdxB, PdxH, or Dxs activity is due to increased 
20 expression of a nucleic acid molecule encoding an Epd, PdxA, PdxJ, PdxF, PdxB, PdxH, 
and/or a Dxs polypeptide as compared to an unmodified parent organism. In another 
embodiment, increased epd, pdxA, pdxJ, pdxF, pdxB, pdxH, and/or dxs nucleic acid 
molecule expression is due to the introduction of nucleic acid molecules encoding an 
Epd, PdxA, PdxJ, PdxF, PdxB, PdxH, and/or a Dxs polypeptide into the organism. 
25 The production methods of the present invention further can include 

recovering the B6 vitamer. 

The instant invention also features genetically modified organisms, e.g, 
microorganisms, (i.e., organisms that contain one or more modifications or mutations in 
the genome) that are capable of producing significantly more of a B6 vitamer than an 
30 unmodified parent organism. In particular, this invention features microorganisms 
(including, for example, but not limited to bacteria, yeasts, fungi, and algae) or macro- 
organisms such as plants that, when genetically modified, produce an increased amount, 
e.g., at least about 10-fold more of a B6 vitamer, than the unmodified parent organism. 
Specific examples are given herein in which Bacillus subtilis and Escherichia coli 
35 strains have been genetically modified such that they produce significant amounts of a 
B6 vitamer. Accordingly, the present invention features organisms that have been 
genetically modified to increase the activity of one or more enzymes that catalyze(s) a 
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step in the biosynthesis of a B6 vitamer, such that B6 vitamer production from said 
modified organism is increased compared to B6 production in an unmodified parent 
organism. 

Yet another aspect of the invention features recombinant organisms, e.g, 
5 microorganisms which overexpress at least one Bacillus {e.g, B. subtilis) B6 vitamer 
biosynthetic enzyme {e.g., at least one of the yaaD, oxyaaE gene products) or at least 
one of the epd, pdxA, pdxJ, pdxF, pdxB, pdxH, and/or dxs gene products, are described. 
In one embodiment, the recombinant microorganism is Gram positive {e.g., 
microorganisms belonging to the genus Bacillus, Cornyebacterium, Lactobacillus, 
10 Lactococci or Streptomyces). In another embodiment, the recombinant microorganism 
is Gram negative. Particularly preferred is a Bacillus recombinant microorganism {e.g., 
Bacillus lichenifomxis, Bacillus amyloliquefaciens, Bacillus subtilis, Bacillus pumilus, 
Bacillus halodurans, and the like). 

Recombinant vectors that contain genes encoding Bacillus B6 vitamer 
15 biosynthetic enzymes, e.g., yaaD or yaaE genes, or homologues thereof, or epd, pdxA, 
pdxJ t pdxF, pdxB, pdxH, or dxs genes, or homologues thereof, are also described. 

Other features and advantages of the invention will be apparent from the 
following detailed description and claims. 

20 

Brief Description of the Drawings 

Figure 1 depicts the chemical structures of vitamin B6 and related 

compounds. 

Figure 2 depicts the biosynthetic pathway for PLP in E. colL 
25 Figure 3 depicts the standard curves generated by Saccharomyces uarum 

strain ATCC 9080 after feeding serial dilutions of FN, PL, and PM (as described in 
Example 1). 

Figure 4 is a schematic representation of the plasmid pDXIF. 
Figure J is a schematic representation of the plasmid pDXl IF. 
30 Figure 6 is a schematic representation of the plasmid pDX14R. 

Figure 7 is a schematic representation of the plasmid pDX17R. 

Detailed Description of the Invention 

The present invention is based, at least in part, on the identification of 
35 Bacillus {e.g., B. subtilis) genes that encode essential enzymes of the B6 vitamer 

biosynthetic pathway. In particular, the present invention features methods based on 
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manipulation of the B6 vitamer biosynthetic pathway in an organism, e.g., a 
microorganism such that certain desirable compounds are produced. 

In particular, the invention is based, at least in part, on the discovery that 
the yaaD and yaaE genes of B. subtilis are required for B6 vitamer synthesis, including, 

5 but not limited to, pyridoxine (or pyridoxol (PN)), pyridoxal (PL), pyridoxamine (PM), 
or the 5' phosphorylated derivatives of any of the three aforementioned compounds 
(PNP, PUP, and PMP). The yaaD and yaaE genes are adjacent on an operon, e.g., the 
yaaDE operon. The yaaD and yaaE genes encode the YaaD and YaaE proteins, 
respectively. Accordingly, based on the discovery of the activity of the polypeptides 

1 0 encoded by the yaaD and yaaE genes, or homologues thereof, in B6 vitamer synthesis, 
the invention is also based, at least in part, on the discovery that modulation of YaaD 
and/or YaaE activity results in the modulation of B6 vitamer production. 

Accordingly, in one aspect, the invention includes a method for 
producing a B6 vitamer comprising culturing an organism with an increased YaaD 

15 and/or YaaE activity (e.g., YaaD and YaaE activity or YaaD activity) as compared to the 
parent organism. In one embodiment, increased YaaD and/or YaaE activity (e.g., YaaD 
and YaaE activity or YaaD activity) is due to increased expression of a nucleic acid 
molecule encoding a YaaD polypeptide and/or a YaaE polypeptide (e.g., a YaaD 
polypeptide and a YaaE polypeptide or a YaaD polypeptide) as compared to an 

20 unmodified parent organism. In another embodiment, increased yaaD and/or yaaE 

nucleic acid molecule expression (e.g, yaaD and yaaE nucleic acid expression or yaaD 
nucleic acid expression) is due to the introduction of nucleic acid molecules encoding a 
YaaD polypeptide and/or a YaaE polypeptide (e.g., a YaaD polypeptide and a YaaE 
polypeptide or a YaaD polypeptide) into the organism. In still another embodiment, the 

25 yaaD nucleic acid encodes a polypeptide comprising the amino acid sequence of SEQ 
ID NO:21 and/or the yaaE nucleic acid encodes a polypeptide comprising the amino 
acid sequence SEQ ID NO:23. In still another embodiment, ayaaD nucleic acid 
encodes a polypeptide comprising an amino acid sequence which is at least 30% 
identical to the amino acid sequence of SEQ ID NO:21, the polypeptide having a YaaD 

30 activity, hi still another embodiment, a yaaE nucleic acid encodes a polypeptide 

comprising an amino acid sequence which is at least 30% identical to the amino acid 
sequence of SEQ ID NO:23, the polypeptide having a YaaE activity. In a further 
embodiment, the nucleic acid molecule comprising the nucleotide sequence of SEQ ID 
NO:20 and/or SEQ ID NO:22 is introduced. The nucleotide sequence of yaaD is set 

35 forth as SEQ ID NO:20 and the nucleotide sequence of yaaE is set forth as SEQ ID 
NO:22. 
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In another aspect, the invention provides methods for producing a B6 
vitamer comprising culturing an organism with an increased Epd, PdxA, PdxJ, PdxF, 
PdxB, PdxH, and/or Dxs activity as compared to the parent organism. In one 
embodiment, increased Epd, PdxA, PdxJ, PdxF, PdxB, PdxH, and/or Dxs activity is due 

5 to increased expression of a nucleic acid molecule encoding an Epd, PdxA, PdxJ, PdxF, 
PdxB, PdxH, and/or a Dxs polypeptide as compared to an unmodified parent organism. 
In another embodiment, increased epd, pdxA, pdxJ, pdxF, pdxB, pdxH, and/or dxs 
nucleic acid molecule expression is due to the introduction of nucleic acid molecules 
encoding an Epd, PdxA, PdxJ, PdxF, PdxB, PdxH, and/or a Dxs polypeptide into the 

10 organism. In still another embodiment, the pdxA nucleic acid encodes a polypeptide 

comprising the amino acid sequence of SEQ ID NO:25 and/or wherein said pdxJ nucleic 
acid encodes a polypeptide comprising the amino acid sequence SEQ ID NO:27. In a 
further embodiment, the pdxA nucleic acid encodes a polypeptide comprising an amino 
acid sequence which is at least 30% identical to the amino acid sequence of SEQ ID 

15 NO:25, the polypeptide having a PdxA activity. In still another embodiment, ihepdxJ 
nucleic acid encodes a polypeptide comprising an amino acid sequence which is at least 
30% identical to the amino acid sequence of SEQ ID NO:27, and having a PdxJ activity. 
In yet another embodiment, the nucleic acid molecule comprising the nucleotide 
sequence of SEQ ID NO:24 and/or SEQ ID NO:26 is introduced. The nucleotide 

20 sequence of pdxA is set forth as SEQ ID NO:24. The nucleotide sequence of p)ixJ is set 
forth as SEQ ID NO:26. 

Overexpression of the yaaDE operon with a strong constitutive promoter 
or the deregulation of the expression of the yaaD or yaaE gene(s) leads to significantly 
increased production of B6 vitamers. These quantities are significantly higher relative 

25 to the associated parent strains than those reported in previous studies, which have 

employed mutant E. coli strains (Dempsey and Arcement (1971) J. BacterioL 107 (2): 
580-582), or mutant B. subtilis strains (Pflug, W., and Lingens, F., (1978) Hoppe- 
Seyler's Z. Physiol. Chem. 359: 559-570). 

Accordingly, the present invention features organisms, e.g., 

30 microorganisms that have been genetically modified to increase the activity of one or 
more enzymes that catalyze a step in the biosynthesis of a B6 vitamer, such that B6 
vitamer production from the modified organism is increased compared to B6 production 
in an unmodified parent organism. In one embodiment, B6 vitamer production is at 
least ten-fold higher than from the unmodified parent organism. In another 

35 embodiment, the organism is genetically modified to overexpress one or more genes that 
encodes an enzyme that catalyzes a step in the biosynthesis of a B6 vitamer, e.g., yaaD 
ox yaaE, or a homologue thereof, or epd, pdxA, pdxJ, pdxF, pdxB, pdxH, or dxs, or a 
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homologue thereof. The organism, e.g., microorganism, may be, for example, B. 
subtilis or E. coli. 

The present invention also features methods of producing a B6 vitamer 
comprising culturing an organism, e.g., a microorganism that has been genetically 

5 modified to overexpress one or more genes that encodes an enzyme that catalyzes a step 
in the biosynthesis of a B6 vitamer, such that B6 vitamer production from said modified 
organism is increased compared to B6 production in an unmodified parent organism, 
under conditions such that the B6 vitamer is produced. The B6 vitamer may then be 
subsequently recovered. 

10 The terms "B6 vitamer" or "B6 vitamers," as used herein, shall refer to 

any compound or mixture of compounds that has any biological activity in any 
biological assay for vitamin B6. B6 vitamers include, but are not limited to, pyridoxine 
(also called pyridoxol or PN), pyridoxal (PL), pyridoxamine (PM), the 5' 
phosphorylated derivatives of any of the three aforementioned compounds (PNP, PLP, 

1 5 and PMP), and any derivative or related compound that can be converted to the active 
forms (PLP or PMP) in a test organism. Thus, for example, the acetate esters or other 
esters of any of the available hydroxyl groups of any of the aforementioned six 
compounds, and which are likely to be hydrolyzed by specific or non-specific esterases, 
are included in B6 vitamers. Also, various salts, such as hydrochloride salts, of any of 

20 the aforementioned compounds are included in B6 vitamers. 

The term "B6 vitamer biosynthetic pathway" includes the biosynthetic 
pathway involving B6 vitamer biosynthetic enzymes (e.g., polypeptides encoded by 
biosynthetic enzyme-encoding genes), compounds (e.g., precursors, substrates, 
intermediates or products), cofactors and the like utilized in the formation or synthesis 

25 of B6 vitamers. The term "B6 vitamer biosynthetic pathway" includes the biosynthetic 
pathway leading to the synthesis of B6 vitamers in a microorganism (e.g., in vivo) as 
well as the biosynthetic pathway leading to the synthesis of B6 vitamers in vitro. 

The term "B6 vitamer biosynthetic enzyme" includes an enzyme which is 
involved in the synthesis of any B6 vitamer in an organism or microorganism, e.g., an 

30 enzyme that is rate limiting for B6 vitamer synthesis in said organism or microorganism. 
B6 vitamer biosynthetic enzymes include, for example, YaaD, YaaE, Epd, PdxA, PdxJ, 
PdxF, PdxB, PdxH, and/or Dxs, or homologues thereof. 

The term "B6 vitamer biosynthetic enzyme activity 35 includes, for 
example, any enzyme activity which results in B6 vitamer synthesis. For example, 

35 PdxA activity includes the catalysis of hydroxythreonine to hydroxyaminoacetone, and 
PdxJ activity includes the catalysis of hydroxyaminoacetone and deoxyeylatose to 
pyridoxol phosphate. 



WO 2004/035010 ^^/US2003/008880 

- 9 - 

Overproduction or increased activity of the rate limiting enzyme for B6 
vitamer production in any organism that is capable of producing B6 vitamers will lead to 
overproduction of B6 vitamers. For example, overexpression or increased activity of 
any of the E. coli genes involved in the PLP pathway will give a measurable increase in 

5 B6 vitamer production. Specifically, overexpression or increased activity of the E. coli 
genes epd, pdxA, pdxJ, pdxF, pdxB, pdxH, or dxs, homologues thereof, or a combination 
thereof; in E. coli may lead to an increase in B6 vitamer production. Moreover, 
overexpression or increased activity of genes of the type B pathway, e.g., HheyaaD or 
yaaE genes (e.g., the yaaD and yaaE genes or the yaaD gene) will result in increased B6 

10 vitamer production in host organisms of the type A pathway, e.g., E. coli. Likewise, 
overexpression or increased activity of genes of the type A pathway, e.g., the epd, pdxA, 
pdxJ, pdxF, pdxB, pdxH, or dxs genes will result in increased B6 vitamer production in 
host organisms of the type B pathway, e.g., B. subtilis. 

A "biological assay for a B6 vitamer" includes, for example, any assay 

1 5 that is capable of quantifying B6 vitamer activity by measuring growth of an organism 
that requires the feeding of a B6 vitamer (i.e., a compound that the fed organism can 
convert into PLP or PMP) for growth. Samples to be assayed are diluted serially in an 
appropriate medium and fed to the appropriate organism. Standard curves are generated 
by serially diluting known amounts of PL, PN, or PM, and feeding these dilutions to the 

20 test organism. By comparing dilutions of the unknown samples to the standard curves, 
total B6 vitamer activity can be determined, for example as PL equivalents if PL was 
used to generate the standard curve. 

Various aspects of the invention are described in further detail in the 
25 following subsections. 



L Genes Encoding Various B6 Vitamer Bi osvnthetic Enzymes 
In one embodiment, the present invention features targeting or modifying 
various biosynthetic genes or enzymes of the B6 vitamer biosynthetic pathway. In 
30 particular, the invention features modifying various enzymatic activities associated with 
said pathways by modifying or altering the genes encoding said biosynthetic enzymes. 

The term "gene", as used herein, includes a nucleic acid molecule (e.g., a 
DNA molecule or segment thereof) that, in an organism, can be separated from another 
gene or other genes, by intergenic DNA (i.e., intervening or spacer DNA which 
35 naturally flanks the gene and/or separates genes in the chromosomal DNA of the 

organism). Alternatively, a gene may slightly overlap another gene (e.g., the 3' end of a 
first gene overlapping the 5' end of a second gene), said overlapping genes separated 
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from other genes by intergenic DNA. A gene may direct synthesis of an enzyme or 
other protein molecule (e.g., may comprise coding sequences, for example, a contiguous 
open reading frame (ORF) which encodes a protein) or may itself be functional in the 
organism. A gene in an organism, may be clustered in an operon, as defined herein, said 
5 operon being separated from other genes and/or operons by the intergenic DNA. An 
"isolated gene", as used herein, includes a gene which is essentially free of sequences 
which naturally flank the gene in the chromosomal DNA of the organism from which 
the gene is derived (i.e., is free of adjacent coding sequences which encode a second or 
distinct protein, adjacent structural sequences or the like) and optionally includes 5' and 
10 3' regulatory sequences, for example promoter sequences and/or terminator sequences. 
In one embodiment, an isolated gene includes predominantly coding sequences for a 
protein {e.g., sequences which encode Bacillus proteins). In another embodiment, an 
isolated gene includes coding sequences for a protein {e.g., for a Bacillus protein) and 
adjacent 5' and/or 3' regulatory sequences from the chromosomal DNA of the organism 
15 from which the gene is derived (e.g., adjacent 5' and/or 3' Bacillus regulatory 

sequences). Preferably, an isolated gene contains less than about 10 kb, 5 kb, 2 kb, 1 kb, 
0.5 kb, 0.2 kb, 0.1 kb, 50 bp, 25 bp or 10 bp of nucleotide sequences that naturally flank 
the gene in the chromosomal DNA of the organism from which the gene is derived. 

The term "operon" includes at least two adjacent genes or ORFs, 
20 optionally overlapping in sequence at either the 5.' or 3' end of at least one gene or ORF. 
The term "operon" includes a coordinated unit of gene expression that contains a 
promoter and possibly a regulatory element associated with one or more adjacent genes 
or ORFs (e.g., structural genes encoding enzymes, for example, biosynthetic enzymes). 
Expression of the genes (e.g., structural genes) can be coordinately regulated, for 
25 example, by regulatory proteins binding to the regulatory element or by anti-termination 
of transcription. The genes of an operon (e.g., structural genes) can be transcribed to 
give a single mRNA that encodes all of the proteins. 

A "gene having a mutation" or "mutant gene" as used herein, includes a 
gene having a nucleotide sequence which includes at least one alteration (e.g., 
30 substitution, insertion, deletion) such that the polypeptide or protein encoded by said 
mutant exhibits an activity that differs from the polypeptide or protein encoded by the 
wild-type nucleic acid molecule or gene. In one embodiment, a gene having a mutation 
or mutant gene encodes a polypeptide or protein having an increased activity as 
compared to the polypeptide or protein encoded by the wild-type gene, for example, 
35 when assayed under similar conditions (e.g., assayed in microorganisms cultured at the 
same temperature). As used herein, an "increased activity" or "increased enzymatic 
activity" is one that is at least 5% greater than that of the polypeptide or protein encoded 
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by the wild-type nucleic acid molecule or gene, preferably at least 5-10% greater, more 
preferably at least 10-25% greater and even more preferably at least 25-50%, 50-75% or 
75-100% greater than that of the polypeptide or protein encoded by the wild-type 
nucleic acid molecule or gene. Ranges intermediate to the above-recited values, e.g. y 
5 75-85%, 85-90%, 90-95%, are also intended to be encompassed by the present 

invention. As used herein, an "increased activity" or "increased enzymatic activity" can 
also include an activity that is at least 1 .25-fold greater than the activity of the 
polypeptide or protein encoded by the wild-type gene, preferably at least 1. 5-fold 
greater, more preferably at least 2-fold greater and even more preferably at least 3-fold, 

10 4-fold, 5-fold, 10-fold, 20-fold, 50-fold, 100-fold or greater than the activity of the 
polypeptide or protein encoded by the wild-type gene. 

In another embodiment, a gene having a mutation or mutant gene 
encodes a polypeptide or protein having a reduced activity as compared to the 
polypeptide 'or protein encoded by the wild-type gene, for example, when assayed under 

15 similar conditions (e.g. 9 assayed in microorganisms cultured at the same temperature). 
A mutant gene also can encode no polypeptide or have a reduced level of production of 
the wild-type polypeptide. As used herein, a "reduced activity" or "reduced enzymatic 
activity" is one that is at least 5% less than that of the polypeptide or protein encoded by 
the wild-type nucleic acid molecule or gene, preferably at least 5-10% less, more 

20 preferably at least 10-25% less and even more preferably at least 25-50%, 50-75% or 
75-100% less than that of the polypeptide or protein encoded by the wild-type nucleic 
acid molecule or gene. Ranges intermediate to the above-recited values, e.g., 75-85%, 
85-90%, 90-95%, are also intended to be encompassed by the present invention. As 
used herein, a "reduced activity" or "reduced enzymatic activity" can also include an 

25 activity that has been deleted or "knocked out" (e.g., approximately 100% less activity 
than that of the polypeptide or protein encoded by the wild-type nucleic acid molecule 
or gene). 

Activity can be determined according to any well accepted assay for 
measuring activity of a particular protein of interest. Activity can be measured or 

30 assayed directly, for example, measuring an enzymatic or biological activity of a protein 
isolated or purified from a cell or microorganism. Alternatively, an activity can be 
measured or assayed within a cell or microorganism or in an extracellular medium. For 
example, assaying for a mutant gene (i.e., said mutant encoding a reduced enzymatic 
activity) can be accomplished by expressing the mutated gene in a microorganism, for 

35 example, a mutant microorganism in which the enzyme is temperature-sensitive, and 
assaying the mutant gene for the ability to complement a temperature sensitive (Ts) 
mutant for enzymatic activity. A mutant gene that encodes an "increased enzymatic 
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activity" can be one that complements the Ts mutant more effectively than, for example, 
a corresponding wild-type gene. A mutant gene that encodes a "reduced enzymatic 
activity" is one that complements the Ts mutant less effectively than, for example, a 
corresponding wild-type gene. 
5 It will be appreciated by the skilled artisan that even a single substitution 

in a nucleic acid or gene sequence (e.g., a base substitution that encodes an amino acid 
change in the corresponding amino acid sequence) can dramatically affect the activity of 
an encoded polypeptide or protein as compared to the corresponding wild-type 
polypeptide or protein. A mutant gene (e.g., encoding a mutant polypeptide or protein), 
1 0 as denned herein, is readily distinguishable from a nucleic acid or gene encoding a 
protein homologue in that a mutant gene encodes a protein or polypeptide having an 
altered activity, optionally observable as a different or distinct phenotype in an organism 
expressing said mutant gene or producing said mutant protein or polypeptide (i.e., a 
mutant microorganism) as compared to a corresponding organism expressing the wild- 
1 5 type gene. By contrast, a protein homologue has an identical or substantially similar 
activity, optionally phenotypically indiscernible when produced in an organism, e.g., a 
microorganism, as compared to a corresponding organism, e.g., microorganism, 
expressing the wild-type gene. Accordingly it is not, for example, the degree of 
sequence identity between nucleic acid molecules, genes, protein or polypeptides that 
20 serves to distinguish between homologues and mutants, rather it is the activity of the 
encoded protein or polypeptide that distinguishes between homologues and mutants: 
homologues having, for example, low (e.g., 30-50% sequence identity) sequence 
identity yet having substantially equivalent functional activities, and mutants, for 
example sharing 99% sequence identity yet having dramatically different or altered 
25 functional activities. 

As used herein, the term "homologue", e.g., ayaaD, yaaE, epd, pdxA, 
pdxJ, pdxF, pdxB, pdxH, or dxs homologue, refers to a molecule having at least about 
30%, 35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 
97%, 98%, 99%, or more nucleotide or amino acid sequence identity to zyaaD, yaaE, 
30 epd, pdxA, pdxJ, pdxF, pdxB, pdxH, or dxs nucleic acid or protein molecule and having a 
specific functional activity, e.g, nyaaD activity, ayaoE activity, upaxA activity, or a 
pdxJ activity. AyaaD activity, zyaaE activity, updxA activity, or apdxJ activity 
includes the ability of the homologue to rescue an auxotroph, e.g., an organism which is 
auxotrophic due to the mutation or deletion of a gene encoding the relevant enzyme 
35 involved in a B6 biosynthetic pathway, in a test system, or the ability of the homologue 
to produce a desired product, e.g., a B6 vitamer or intermediate in the B6 vitamer 
biosynthetic pathway. 
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Ih one embodiment, methods for testing potential homologues, e.g., 
yaaD, yaaE, epd, pdxA, pdxJ, pdxF, pdxB, pdxH, or dxs homologues, for zyaaD, yaaE, 
epd, pdxA, pdxJ, pdxF, pdxB, pdxH, or dxs functional activity include complementation 
assays. For example, a complementation assay includes an assay wherein the potential 

5 homologue, e.g., a potential homologue isolated, from a plant organism or a 

microorganism, is expressed in an auxotrophic Bacillus microorganism in which a 
portion of the yaaD or yaaE gene has been deleted, and the ability of the potential 
homologue to rescue the microorganism from auxotrophy is measured. In one 
embodiment, a Bacillus promoter and ribosome binding site, such as, for example, a 

10 ribosome binding site described herein, is utilized for the expression of the homologue 
in the Bacillus microorganism. In another embodiment, the potential homologue is 
expressed in an auxotrophic E. coli microorganism (for example, zpdxA, pdxB, pdxF, 
pdxJ, dxs, or epd mutant, and the ability of the potential homologue to rescue the 
microorganism from auxotrophy is measured. 

15 A secondary assay for testing potential homologues, e.g., yaaD, yaaE, 

epd, pdxA, pdxJ, pdxF, pdxB, pdxH, or dxs homologues, for a yaaD, yaaE, epd, pdxA, 
pdxJ, pdxF, pdxB, pdxH, or dxs functional activity includes an assay which directly 
measures the amount of desired product, e.g., a B6 vitamer, produced by the 
recombinant organism. For example, the amount of B6 vitamer produced may be 

20 measured by methods described herein, e.g., by HPLC analysis, and by other methods 
known in the art. 

A "potential homologue" as used herein, includes a molecule with 30%, 
35%, 40%, 45%, 50%, 55%, 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 
98%, 99%, or more nucleotide or amino acid sequence identity to a yaaD, yaaE, epd, 

25 pdxA, pdxJ, pdxF, pdxB, pdxH, or dxs nucleic acid or protein molecule, where a 

functional activity of the molecule has not yet been established through, for example, an 
assay described herein for measuring functional activity. 

In a preferred embodiment, the genes of the present invention are derived 
from Bacillus. The term "derived from Bacillus" or "5a«7/us-derived" includes a gene 

30 which is naturally found in microorganisms of the genus Bacillus. In another preferred 
embodiment, the genes of the present invention are derived from a microorganism 
selected from the group consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus 
lentus, Bacillus firmus, Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus 
cereus, Bacillus circulans, Bacillus coagulans, Bacillus licheniformis, Bacillus 

35 megaterium, Bacillus pumilus, Bacillus thuringiensis, Bacillus halodurans, and other 
Group 1 Bacillus species, for example, as characterized by 16S rRNA type. In another 
preferred embodiment, the gene is derived from Bacillus brevis or Bacillus 
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stearothermophilus. In another preferred embodiment, the genes of the present 
invention are derived from a microorganism selected from the group consisting of 
Bacillus licheniformis, Bacillus amyloliquefaciens, Bacillus subtilis, and Bacillus 
pumilus. Id a particularly preferred embodiment, the gene is derived from Bacillus 

5 subtilis (e.g., is Bacillus subtilis-derived). The term "derived from Bacillus subtilis' 9 or 
"Bacillus subtilis-derived" includes a gene which is naturally found in the 
microorganism Bacillus subtilis. Included within the scope of the present invention are 
Bacillus-derived genes (e.g., B. subtilis-derived genes), for example, Bacillus or B. 
subtilis yaaD ovyaaE genes. 

10 In a preferred embodiment, the genes of the present invention are derived 

from Escherichia. The term "derived from Escherichia" or "Escherichia-derived" 
includes a gene which is naturally found in microorganisms of the genus Escherichia. 
In a particularly preferred embodiment, the gene is derived from Escherichia coli (e.g., 
is Escherichia coli-derived). The term "derived from Escherichia coif 9 or "Escherichia 

1 5 co/i-derived" includes a gene which is naturally found in the microorganism Escherichia 
coli. Included within the scope of the present invention are Escherichia-derived genes 
(e.g. 9 Escherichia coli-derived genes), for example, Escherichia or Escherichia coli epd, 
pdxA, pdxJ, pdxF f pdxB, pdxH, or dxs genes. 

In another preferred embodiment, the genes of the present invention are 

20 derived, for example, from any one of the organisms or microorganisms listed in Section 
EI or in either of Tables 9 or 10 (or in the GenBank records referred to by Accession 
Nos. listed therein). 

II. Recombinant Nucleic Acid Molecules and Vectors 
25 The present invention further features recombinant nucleic acid 

molecules (e.g., recombinant DNA molecules) that include genes described herein (e.g., 
isolated genes), preferably Bacillus genes, more preferably Bacillus subtilis genes, even 
more preferably Bacillus subtilis B6 vitamer biosynthetic genes. The term "recombinant 
nucleic acid molecule" includes an isolated nucleic acid molecule (e.g., a DNA 
30 molecule) that has been altered, modified or engineered such that it differs in nucleotide 
sequence from the native or natural nucleic acid molecule from which the recombinant 
nucleic acid molecule was derived (e.g., by addition, deletion or substitution of one or 
more nucleotides). Preferably, a recombinant nucleic acid molecule (e.g., a recombinant 
DNA molecule) includes an isolated gene of the present invention operably linked to 
35 regulatory sequences. The phrase "operably linked to regulatory sequence(s)" means 
that the nucleotide sequence of the gene of interest is linked to the regulatory 
sequence(s) in a maimer which allows for expression (e.g., enhanced, increased, 
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constitutive, basal, attenuated, decreased or repressed expression) of the gene, preferably 
expression of a gene product encoded by the gene (e.g., when the recombinant nucleic 
acid molecule is included in a recombinant vector, as defined herein, and is introduced 
into a microorganism). A "recombinant organism" is any organism that contains a 
5 recombinant nucleic acid molecule. 

The term "regulatory sequence" includes nucleic acid sequences that 
affect (e.g., modulate or regulate) expression of other nucleic acid sequences (i.e., 
genes). In one embodiment, a regulatory sequence is included in a recombinant nucleic 
acid molecule in a similar or identical position and/or orientation relative to a particular 
10 gene of interest as is observed for the regulatory sequence and gene of interest as it 
appears in nature, e.g., in a native position and/or orientation. For example, a gene of 
interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence which accompanies or is adjacent to the gene of interest in the 
natural organism (e.g, operably linked to "native" regulatory sequences (eg., to the 
1 5 "native" promoter). Alternatively, a gene of interest can be included in a recombinant 
nucleic acid molecule operably linked to a regulatory sequence which accompanies or is 
adjacent to another (e.g., a different) gene in the natural organism. Alternatively, a gene 
of interest can be included in a recombinant nucleic acid molecule operably linked to a 
regulatory sequence from another organism. For example, regulatory sequences from 
20 other microbes (e.g. , other bacterial regulatory sequences, bacteriophage regulatory 
sequences and the like) can be operably linked to a particular gene of interest. 

In one embodiment, a regulatory sequence is a non-native or non- 
naturally-occurring sequence {e.g., a sequence which has been modified, mutated, 
substituted, derivatized, deleted including sequences which are chemically synthesized). 
25 Preferred regulatory sequences include promoters, enhancers, termination signals, anti- 
termination signals, ribosome binding sites and other expression control elements (e.g., 
sequences to which repressors or inducers bind and/or binding sites for transcriptional 
and/or translational regulatory proteins, for example, in the transcribed mRNA). Such 
regulatory sequences are described, for example, in Sambrook, J., Fritsh, E. F., and 
30 Maniatis, T. Molecular Cloning: A Laboratory Manual. 2nd, ed., Cold Spring Harbor 
Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, ISfY, 1989. 
Regulatory sequences include those which direct constitutive expression of a nucleotide 
sequence in a microorganism (e.g., constitutive promoters and strong constitutive 
promoters), those which direct inducible expression of a nucleotide sequence in a 
35 microorganism (e.g., inducible promoters, for example, xylose inducible promoters) and 
those which attenuate or repress expression of a nucleotide sequence in a microorganism 
(e.g., attenuation signals or repressor sequences). It is also within the scope of the 
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present invention to regulate expression of a gene of interest by removing or deleting 
regulatory sequences. For example, sequences involved in the negative regulation of 



5 invention includes a nucleic acid sequence or gene that encodes at least one bacterial 
gene product (e.g., a B6 vitamer biosynthetic enzyme, e.g., the gene product of yaaD 
and/or yaaE, (e.g., the gene product of yaaD and yaaE or yaaD) or a homologue thereof) 
operably linked to a promoter or promoter sequence. Preferred promoters of the present 
invention include Bacillus promoters and/or bacteriophage promoters (e.g., 

1 0 bacteriophage which infect Bacillus). In one embodiment, a promoter is a Bacillus 
promoter, preferably a strong Bacillus promoter (e.g., a promoter associated with a 
biochemical housekeeping gene in Bacillus or a promoter associated with a glycolytic 
pathway gene in Bacillus). In another embodiment, a promoter is a bacteriophage 
promoter. In a preferred embodiment, the promoter is from the bacteriophage SP01 . In 

15 a particularly preferred embodiment, a promoter is selected from the group consisting of 
Pis, Pi6<> or Pveg having for example, the following respective sequences: 
GCTATTGACGACAGCTATGGTTCACTGTCCACCAACCAAAACTGTGCTCAGT 
ACCGCCAATATTTCTCCCTTGAGGGGTACAAAGAGGTGTCCCTAGAAGAGAT 
CCACGCTGTGTAAAAATTTTA 

20 GTAATAATTTAATTACAGGCGGGGGCAACCCCGCCTGT (SEQ ID NO:9), 

GCCTACCTAGCTTCCAAGAAAGATATCCTAACAGCACAAGAGCGGAAAGAT 
GTTTTGTTCTACATCCAGAACAA 

AAAAAGTTGTTGACTTTATCTACAAGGTGTGGTATAATAATCTTAACAACAG 
CAGGACGC (SEQ ID NO: 10); and 

25 GAGGAATCATAGAATTTTGTCAAAATAATTTTATT 

GTTGATATAATTTAAATTTTATTTGACAAAAATGGGC^ 
AATGTAGTGAGGTGGATGCAATG (SEQ ID NO:l 1). Additional preferred 
promoters include te/(the translational elongation factor (TEF) promoter) and pyc (the 
pyruvate carboxylase (PYC) promoter), which promote high level expression in Bacillus 

30 (e.g, Bacillus subtilis). Additional preferred promoters, for example, for use in Gram 
positive microorganisms include, but are not limited to, amy and SP02 promoters. 
Additional preferred promoters, for example, for use in Gram negative microorganisms 
include, but are not limited to, tac, trp, tet, trp-tet, Ipp, lac, Ipp-lac, lacIQ, 17, T5, T3, 
gal, trc, ara, SP6, A,-PR or A.-PL. 

35 In another embodiment, a recombinant nucleic acid molecule of the 

present invention includes a terminator sequence or terminator sequences (e.g., 
transcription terminator sequences). The term 'terminator sequences" includes 



transcription can be removed such that expression of a gene of interest is enhanced. 

In one embodiment, a recombinant nucleic acid molecule of the present 
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regulatory sequences that serve to terminate transcription of mRNA Terminator 
sequences (or tandem transcription terminators) can further serve to stabilize mRNA 
(e.g, by adding structure to mRNA), for example, against nucleases. 

In yet another embodiment, a recombinant nucleic acid molecule of the 

5 present invention includes sequences which allow for detection of the vector containing 
said sequences (i.e., detectable and/or selectable markers), for example, genes that 
encode antibiotic resistance or sequences mat overcome auxotrophic mutations, for 
example, trpC, fluorescent markers, drug markers, and/or colorimetric markers (e.g., 
/acZ/p-galactosidase). In yet another embodiment, a recombinant nucleic acid molecule 

10 of the present invention includes an artificial ribosome binding site (RBS) or a sequence 
that becomes transcribed into an artificial RBS. The term "artificial ribosome binding 
site (RBS)" includes a site within an mRNA molecule (e.g., coded within DNA) to 
which a ribosome binds (e.g, to initiate translation) which differs from a native RBS 
(e.g., a RBS found in a naturaUy-occurring gene) by at least one nucleotide. Preferred 

15 artificial RBSs include about 5-6, 7-8, 9-10, 11-12, 13-14, 15-16, 17-18, 19-20, 21-22, 
23-24, 25-26, 27-28, 29-30 or more nucleotides of which about 1-2, 3-4, 5-6, 7-8, 9-10, 
1 1-12, 13-15 or more differ from the native RBS {e.g., the native RBS of a gene of 
interest, for example, the native yaaD RBS 

GAAATCATATAACTATACCTTGATTAGGGGGACCAAGAAATG 

20 (SEQ ID NO: 1 2) or the native yaaE RBS 

CAAGAACGCGGCTGGTAAGAACATAGGAGCGCTGCTGACATG (SEQ ID 

NO:13)). 

Preferably, nucleotides that differ are substituted such that they are 
identical to one or more nucleotides of an ideal RBS when optimally aligned for 
25 comparisons. Artificial RBSs can be used to replace the naturally-occurring or native 
RBSs associated with a particular gene. Artificial RBSs preferably increase translation 
of aparticular gene. Preferred artificial RBSs (e.g., RBSs for increasing the translation 
of yaaE, for example, of B. subtilis) are set forth in Table 1, below. 

30 Table 1: Preferred Ribosome Binding. Sites 

10 20 

' ' SEQ ID NO: 



35 Native_yaaD ---GAAATCATATAACTATACCTTGATTAGGGGGACC-AAGAAATG 12 

Native _yaaE CAAGAACGCGGCTGGTAAGAACAT AGGAGCGCTGCTGACATG 13 

IDEAL RBS TCTAGAAAGG AGGTG A 14 

RBS1 ~ TCTAGAAGG AGGAG AAAACATG 15 

RBS2 TCTAGAGG AGGAG AAAACATG 16 

40 RBSlOl TAAGAACAA AGGAGGAGAGCTGACATG 17 

RBS103 TAAGAAGAA AGGAGGTGAGCTGACATG 18 

RBS102 TAAGAACAG AGGAGGAGAGCTGACATG 19 
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The present invention further features vectors (e.g., recombinant vectors) 
that include nucleic acid molecules (e.g., genes or recombinant nucleic acid molecules 
comprising said genes) as described herein. The term "recombinant vector" includes a 
5 vector (e.g. , plasmid, phage, phasmid, virus, cosmid or other purified nucleic acid 

vector) that has been altered, modified or engineered such that it contains greater, fewer 
or different nucleic acid sequences than those included in the native or natural nucleic 
acid molecule from which the recombinant vector was derived. Preferably, the 
recombinant vector includes a biosynthetic enzyme-encoding gene or recombinant 

10 nucleic acid molecule including said gene, operably linked to regulatory sequences, for 
example, promoter sequences, terminator sequences and/or artificial ribosome binding 
sites (RBSs), as defined herein. In another embodiment, a recombinant vector of the 
present invention includes sequences that enhance replication in bacteria (e.g., 
replication-enhancing sequences). In one embodiment, replication-enhancing sequences 

15 are derived from E. coli. In another embodiment, replication-enhancing sequences are 
derived from pBR322. In another embodiment, replication-enhancing sequences are 
derived from pSClOl. 

In yet another embodiment, a recombinant vector of the present invention 
includes antibiotic resistance sequences. The term "antibiotic resistance sequences" 

20 includes sequences which promote or confer resistance to antibiotics on the host 

organism (e.g., Bacillus). In one embodiment, the antibiotic resistance sequences are 
selected from the group consisting of cat (chloramphenicol resistance) sequences, tet 
(tetracycline resistance) sequences, erm (erythromycin resistance) sequences, neo 
(neomycin resistance) sequences, kan (kanamycin resistance) and spec (spectinomycin 

25 resistance) sequences. Recombinant vectors of the present invention can further include 
homologous recombination sequences (e.g., sequences designed to allow recombination 
of the gene of interest into the chromosome of the host organism). For example, bpr, 
vpr 9 and/or amyE sequences can be used as homology targets for recombination into the 
host chromosome. It will further be appreciated by one of skill in the art that the design 

30 of a vector can be tailored depending on such factors as the choice of microorganism to 
be genetically engineered, the level of expression of gene product desired and the like. 



HI. Recombinant Organisms 

The present invention further features organisms, i.e., recombinant 
35 organisms e.g., recombinant microorganisms, that include vectors or genes (e.g., wild- 
type and/or mutated genes) as described herein. As used herein, the term "recombinant 
organism" includes an organism or a microorganism (e.g., bacteria, yeast cell, fungal 
cell, plant cell, etc.) that has been genetically altered, modified or engineered (e.g., 
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genetically engineered) such that it exhibits an altered, modified or different genotype 
and/or phenotype (e.g., when the genetic modification affects coding nucleic acid 
sequences of the organism) as compared to the naturaUy-occuiring organism from which 
it was derived, e.g., a wild type or unmodified organism or parent organism or host 
5 organism. 

Possible host organisms include, for example, plants, algae, fungi, yeasts, 
and other organisms and microorganisms. The plant may be a monocot, dicot or 
gymnosperm. Preferred microbes are those used in agriculture or by industry, and those 
that are pathogenic for plants or animals. Plants include arabidopsis; field crops (e.g., 

10 alfalfa, barley, bean, cereals, corn, cotton, flax, lucerne, hemp, millet, oats, pea, rape, 
rice, rye, safflower, sorghum, soybean, sunflower, tobacco, triticale, and wheat); 
vegetable crops (e.g., asparagus, beet, broccoli, cabbages, capsicum, carrot, cauliflower, 
celery, cucumber, eggplant, lettuces, onion, pepper, potato, pumpkin, maize, radish, 
spinach, squash, taro, tomato, and zucchini); fruit and grapevine and nut species (e.g., 

15 almond, apple, apricot, banana, black- berry, blueberry, cacao, cherry, coconut, 
cranberry, date, faJoa, filbert, grape, grapefruit, guava, kiwi, lemon, lime, mango, 
melon, nectarine, orange, papaya, passion fruit, peach, peanut, pear, pineapple, 
pistachio, plum, raspberry, strawberry, tangerine, walnut, and watermelon); 
Brassicaceae such as oilseed, canola, sugarbeet, sugarcane; and ornamentals (e.g., alder, 

20 ash, aspen, azalea, birch, boxwood, camellia, carnation, chrysanthemum, elm, fir, ivy, 
jasmine, juniper, oak, palm, poplar, pine, redwood, rhododendron, rose, rubber, and 
yew). 

Fungi which may be used as host organisms include organisms in both 
the mold and yeast morphologies. Possible fungal host organisms include the phylum 

25 Ascomycota, including, for example, the subphyla Neolectomycetes, Pezizomycotina, 
Pneumocystidomycetes, Saccharomycotina, Schizosaccharomycetes, Taphrinomycetes, 
mitosporic Ascomycota, and unclassified Ascomycota; the phylum Basidiomycota, 
including, for example, the subphyla Hymenomecetes, Urediniomycetes, 
Ustilaginomycetes, Mitosporic, Masidiomycota, and unclassified Basidiomycota', the 

30 phylum Chytridiomycota 9 including, for example, the subphyla, Blastocladiales, 
Chytridiales, Monoblepharidales, Neocallimasticales, Spizellomycetales, and 
unclassified Chytridiomycota; the phylum Glomeromycota, including, for example, the 
subphylum Glomeromycetes; the phylum Microsporidia, including, for example, the 
subphyla, Apansporoblastina, Endoreticulatus, Gurleyidae, Nadelsporidae, 

35 Ordosporidae, Pansporablastina, Pereziidae, Pseudonosematidae, Tetramicriidae, and 
unclassified Microsporidia; the phylum Zygomycota, including, for example, the 



WO 2004/035010 ^^/US20 03/008880 

-20- 

subphyla, Trichomycetes, Zygomycetes, and unclassified Zygomycota and unclassified 
Fungi. 

Algae which may be used as host organisms include, for example, green 
algae, red algae, yellow-green algae, Cryptomonads, haptophytes, golden algae, 

5 euglenids, diatoms, and brown algae. 

Other organisms, e.g., microorganisms, which may be used as host 
organisms include Gram negative bacteria (e.g., a microorganisms which excludes basic 
dye, for example, crystal violet) and Gram positive bacteria {e.g., a microorganism 
which retains basic dye, for example, crystal violet, due to the presence of a Gram- 

10 positive wall surrounding the microorganism). Gram negative bacteria include, for 
example, Acetobacteriaceae, Alcaligenaceae, Bacteroidaceae, Chromatiaceae, 
Enterobacteriaceae, Legionellaceae, Neisseriaceae, Nitrobacteriaceae, 
Pseudomonadaceae, Rhizobiaceae, Rickettsiaceae, Spirochaetaceae, Vibrionaceae. 
Gram positive bacteria include, for example, Bacillaceae, Micrococcaceae, and 

1 5 Peptococcaceae. 

In a preferred embodiment, the recombinant microorganism is a Gram 
positive microorganism belonging to a genus selected from the group consisting of 
Bacillus, Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. In a more 
preferred embodiment, the recombinant microorganism is of the genus Bacillus. In 

20 another preferred embodiment, the recombinant microorganism is selected from the 
group consisting of Bacillus subtilis, Bacillus lentimorbus, Bacillus lentus, Bacillus 
firmus, Bacillus pantothenticus, Bacillus amyloliquefaciens, Bacillus cereus, Bacillus 
circulans, Bacillus coagulans, Bacillus licheniformis, Bacillus megaterium, Bacillus 
pumilus, Bacillus thuringiensis, Bacillus halodurans, and other Group 1 Bacillus 

25 ' species, for example, as characterized by 1 6S rRNA type. In another preferred 
embodiment, the recombinant microorganism is Bacillus brevis ox Bacillus 
stearothermophilus. In another preferred embodiment, the recombinant microorganism 
is selected from the group consisting of Bacillus licheniformis, Bacillus 
amyloliquefaciens, Bacillus subtilis, and Bacillus pumilus. 

30 In another preferred embodiment, the recombinant microorganism is a 

Gram negative microorganism belonging to a genus selected from the group consisting 
of Salmonella, Escherichia, Klebsiella, Serratia, and Proteus. In a more preferred 
embodiment, the recombinant microorganism is of the genus Escherichia. In an even 
more preferred embodiment, the recombinant microorganism is Escherichia coli. In 

35 another embodiment, the recombinant microorganism is Saccharomyces (e.g. , S. 
cerevisiae). 
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A preferred "recombinant" organism, e.g., microorganism of the present 
invention is an organism, e.g., a microorganism having a deregulated B6 vitamer 
biosynthesis pathway or enzyme. The term "deregulated" or "deregulation" includes the 
alteration or modification of at least one gene in an organism, e.g., a microorganism that 
encodes an enzyme in a biosynthetic pathway, such that the level or activity of the 
biosynthetic enzyme in the organism, e.g., the microorganism is altered or modified. 
Preferably, at least one gene that encodes an enzyme in a biosynthetic pathway is altered 
or modified such that the gene product is enhanced or increased. The phrase 
"deregulated pathway" can also include a biosynthetic pathway in which more than one 
gene that encodes an enzyme in a biosynthetic pathway is altered or modified such that 
the level or activity of more than one biosynthetic enzyme is altered or modified. The 
ability to "deregulate" a pathway (e.g., to simultaneously deregulate more than one gene 
in a given biosynthetic pathway) in an organism, e.g., a microorganism in some cases 
arises from the particular phenomenon of organisms, e.g., microorganisms in which 
more than one enzyme (e.g., two or three biosynthetic enzymes) are encoded by genes 
occurring adjacent to one another on a contiguous piece of genetic material termed an 
"operon" (defined herein). Due to the coordinated regulation of genes included in an 
operon, alteration or modification of the single promoter and/or regulatory element can 
result in alteration or modification of the expression of more than one gene product 
encoded by the operon. Alteration or modification of the regulatory element can 
include, but is not limited to removing the endogenous promoter and/or regulatory 
element(s), adding strong promoters, inducible promoters or multiple promoters or 
removing regulatory sequences such that expression of the gene products is modified, 
modifying the chromosomal location of the operon, altering nucleic acid sequences 
adjacent to the operon or within the operon such as a ribosome binding site, increasing 
the copy number of the operon, modifying proteins (e.g., regulatory proteins, 
suppressors, enhancers, transcriptional activators and the like) involved in transcription 
of the operon and/or translation of the gene products of the operon, or any other 
conventional means of deregulating expression of genes routine in the art (including but 
not limited to use of antisense nucleic acid molecules, for example, to block expression 
of repressor proteins). Deregulation can also involve altering the coding region of one 
or more genes to yield, for example, an enzyme that is feedback resistant or has a higher 
or lower specific activity. 

In another preferred embodiment, a recombinant organism, e.g., 
microorganism is designed or engineered such that at least one B6 vitamer biosynthetic 
enzyme, is OYerexpressed. The term "overexpressed" or "overexpression" includes 
expression of a gene product (e.g., a biosynthetic enzyme) at a level greater than that 
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expressed prior to manipulation of the microorganism or in a comparable organism, e.g., 
microorganism which has not been manipulated. In one embodiment, the organism, 
e.g., the microorganism can be genetically designed or engineered to overexpress a level 
of gene product greater than that expressed in a comparable organism, e.g, 

5 microorganism which has not been engineered. 

Genetic engineering can include, but is not limited to, altering or 
modifying regulatory sequences or sites associated with expression of a particular gene 
(e-g-* by adding strong promoters, inducible promoters or multiple promoters or by 
removing regulatory sequences such that expression is constitutive), modifying the 

10 chromosomal location of a particular gene, altering nucleic acid sequences adjacent to a 
particular gene such as aribosome binding site, increasing the copy number of a 
particular gene, modifying proteins (e.g., regulatory proteins, suppressors, enhancers, 
transcriptional activators and the like) involved in transcription of a particular gene 
and/or translation of a particular gene product, or any other conventional means of 

1 5 deregulating expression of a particular gene routine in the art (including but not limited 
to use of antisense nucleic acid molecules, for example, to block expression of repressor 
proteins). Genetic engineering can also include deletion of a gene, for example, to block 
a pathway or to remove a repressor. In embodiments featuring organisms having 
deleted genes, the skilled artisan will appreciate that at least low levels of certain 

20 compounds may be required to be present in or added to the culture medium in order 

that the viability of the organism is not compromised. Often, such low levels are present 
in complex culture media as routinely formulated. Moreover, in processes featuring 
culturing organisms having deleted genes cultured under conditions such that 
commercially or industrially attractive quantities of product are produced, it may be 

25 necessary to supplement culture media with slightly increased levels of certain 
compounds. 

In another embodiment, the microorganism can be physically or 
environmentally manipulated to overexpress a level of gene product greater than that 
expressed prior to manipulation of the organism or in a comparable microorganism 

30 which has not been manipulated. For example, a organism can be treated with or 
cultured in the presence of an agent known or suspected to increase transcription of a 
particular gene and/or translation of a particular gene product such that transcription 
and/or translation are enhanced or increased. Alternatively, a organism can be cultured 
at a temperature selected to increase transcription of a particular gene and/or translation 

35 of a particular gene product such that transcription and/or translation are enhanced or 
increased. 
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IV. Culturine and Fermenting Recombin ant Microorganisms 
The term "culturing" includes maintaining and/or growing a living 
microorganism of the present invention (e.g., maintaining and/or growing a culture or 
strain). In one embodiment, a microorganism of the invention is cultured in liquid 

5 media. In another embodiment, a microorganism of the invention is cultured in solid 
media or semi-solid media. In a preferred embodiment, a microorganism of me 
invention is cultured in media (e.g., a sterile, liquid media) comprising nutrients 
essential or beneficial to the maintenance and/or growth of the microorganism (e.g., 
carbon sources or carbon substrate, for example carbohydrate, hydrocarbons, oils, fats, 

10 fatty acids, organic acids, and alcohols; nitrogen sources, for example, peptone, yeast 
extracts, meat extracts, malt extracts, soy flour, urea, ammonium sulfate, ammonium 
chloride, ammonium nitrate and ammonium phosphate; phosphorus sources, for 
example, phosphoric acid, sodium and potassium salts thereof; trace elements, for 
example, magnesium, iron, manganese, calcium, copper, zinc, boron, molybdenum, 

15 and/or cobalt salts; as well as growth factors such as amino acids, vitamins, and the 
like). 

Preferably, microorganisms of the present invention are cultured under 
controlled pH. The term "controlled pH" includes any pH which results in production of 
the desired product. In one embodiment microorganisms are cultured at a pH of about 7. 
20 In another embodiment, microorganisms are cultured at a pH of between 6.0 and 8.5 . 
The desired pH may be maintained by any number of methods known to those skilled in 
the art. 

Also preferably, microorganisms of the present invention are cultured 
under controlled aeration. The term "controlled aeration" includes sufficient aeration 

25 (e.g. , oxygen) to result in production of the desired product. In one embodiment, 
aeration is controlled by regulating oxygen levels in the culture, for example, by 
regulating the amount of oxygen dissolved in culture media. Preferably, aeration of the 
culture is controlled by agitating the culture. Agitation may be provided by a propeller 
or similar mechanical agitation equipment, by revolving or shaking the culture vessel 

30 (e.g., tube or flask) or by various pumping equipment. Aeration may be further 

controlled by the passage of sterile air or oxygen through the medium (e.g., through the 
fermentation mixture). Also preferably, microorganisms of the present invention are 
cultured without excess foaming (e.g., via addition of antifoaming agents). 

Moreover, microorganisms of the present invention can be cultured under 

35 controlled temperatures. The term "controlled temperature" includes any temperature 
which results in production of the desired product (e.g., a B6 vitamer). In one 
embodiment, controlled temperatures include temperatures between 15°C and 95°C. In 
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another embodiment, controlled temperatures include temperatures between 15°C and 
70°C. Preferred temperatures are between 20°C and 55°C, more preferably between 
30°C and 50°C. 

Microorganisms can be cultured (e.g., maintained and/or grown) in liquid 
5 media and preferably are cultured, either continuously or intermittently, by conventional 
culturing methods such as standing culture, test tube culture, shaking culture (e.g., rotary 
shaking culture, shake flask culture, etc.), aeration spinner culture, or fermentation. In a 
preferred embodiment, the microorganisms are cultured in shake flasks. In a more 
preferred embodiment, the microorganisms are cultured in a fermentor (e.g., a 
1 0 fermentation process). Fermentation processes of the present invention include, but are 
not limited to, batch, fed-batch and continuous processes or methods of fermentation. 
The phrase "batch process" or "batch fermentation" refers to a closed system in which 
the composition of media, nutrients, supplemental additives and the like is set at the 
be ginnin g of the fermentation and not subject to alteration during the fermentation, 
1 5 however, attempts may be made to control such factors as pH and oxygen concentration 
to prevent excess media acidification and/or microorganism death. The phrase "fed- 
batch process" or "fed-batch" fermentation refers to a batch fermentation with the 
exception that one or more substrates or supplements are added (e.g., added in 
increments or continuously) as the fermentation progresses. The phrase "continuous 
20 process" or "continuous fermentation" refers to a system in which a defined 

fermentation media is added continuously to a fermentor and an equal amount of used or 
"conditioned" media is simultaneously removed, preferably for recovery of the desired 
product (e.g., a B6 vitamer). A variety of such processes have been developed and are 
well-known in the art. 

25 The phrase "cmturing under conditions such that a desired compound is 

produced" includes mamtaining and/or growing microorganisms under conditions (e.g., 
temperature, pressure, pH, duration, etc.) appropriate or sufficient to obtain production 
of the desired compound or to obtain desired yields of the particular compound being 
produced. For example, culturing is continued for a time sufficient to produce the 

30 desired amount of a compound (e.g., a B6 vitamer). Preferably, culturing is continued 
for a time sufficient to substantially reach suitable production of the compound (e.g., a 
time sufficient to reach a suitable concentration of a B6 vitamer). In one embodiment, 
culturing is continued for about 12 to 24 hours. In another embodiment, culturing is 
continued for about 24 to 36 hours, 36 to 48 hours, 48 to 72 hours, 72 to 96 hours, 96 to 

35 120 hours, 120 to 144 hours, or greater than 144 hours. The methodology of the present 
invention can further include a step of recovering a desired compound (e.g., a B6 
vitamer). The term "recovering" a desired compound includes extracting, harvesting, 
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isolating or purifying the compound from culture media. Recovering the compound can 
be performed according to any conventional isolation or purification methodology 
known in the art including, but not limited to, treatment with a conventional resin {e.g., 
anion or cation exchange resin, non-ionic adsorption resin, etc.), treatment with a 
5 conventional adsorbent {e.g., activated charcoal, silicic acid, silica gel, cellulose, 

alumina, etc.), alteration of pH, solvent extraction {e.g., with a conventional solvent such 
as an alcohol, ethyl acetate, hexane and the like), dialysis, filtration, concentration, 
crystallization, recrystallization, pH adjustment, lyophilization and the like. For 
example, a compound can be recovered from culture media by first removing the 
10 microorganisms from the culture. The resulting solutions are then passed through or 
over a cation exchange resin to remove cations and/or through or over an anion 
exchange resin to purify or concentrate the desired product. The resulting compound can 
subsequently be converted to a salt {e.g, a chloride or sulfate salt) by ion exchange. 

Preferably, a desired compound of the present invention is "extracted," 
1 5 "isolated" or "purified" such that the resulting preparation is substantially free of other 
media components (e.g., free of media components and/or fermentation byproducts). 
The language "substantially free of other media components" includes preparations of 
the desired compound in which the compound is separated from media components or 
fermentation byproducts of the culture from which it is produced. In one embodiment, 
20 the preparation has greater than about 80% (by dry weight) of the desired compound 

{e.g., less than about 20% of other media components or fermentation byproducts), more 
preferably greater than about 90% of the desired compound {e.g., less man about 10% of 
other media components or fermentation byproducts), still more preferably greater than 
about 95% of the desired compound {e.g., less than about 5% of other media 
25 components or fermentation byproducts), and most preferably greater than about 98- 
99% desired compound {e.g., less than about 1-2% other media components or 
fermentation byproducts). When the desired compound has been derivatized to a salt, 
the compound is preferably further free of chemical contaminants associated with the 
formation of the salt. When the desired compound has been derivatized to an alcohol, 
30 the compound is preferably further free of chemical contaminants associated with the 
formation of the alcohol. 

In an alternative embodiment, the desired compound is not purified from 
the microorganism, for example, when the microorganism is biologically non-hazardous 
{e.g., safe). For example, the entire culture (or culture supernatant) can be used as a 
35 source of product {e.g., crude product). In one embodiment, the culture (or culture 
supernatant) is used without modification. In another embodiment, the culture (or 
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culture supernatant) is concentrated. In yet another embodiment, the culture (or culture 
supernatant) is dried or lyophilized. 

Preferably, a production method of the present invention results in 
production of the desired compound, e.g., a B6 vitamer, at a significantly high yield. 
5 The phrase "significantly high yield" includes a level of production or yield which is 
sufficiently elevated or above what is usual for comparable production methods, for 
example, which is elevated to a level sufficient for commercial production of the desired 
product (e.g., production of the product at a commercially feasible cost). In one 
embodiment, the invention features a production method that includes culturing a 
1 0 recombinant microorganism under conditions such that the desired product (e.g. , a B6 
vitamer) is produced at a level greater than 5 mg/L. In another embodiment, the 
invention features a production method that includes culturing a recombinant 
microorganism under conditions such that the desired product (e.g., a B6 vitamer) is 
produced at a level greater than 10 mg/L. In another embodiment, the invention features 
15 a production method that includes culturing a recombinant microorganism under 
conditions such that the desired product (e.g., a B6 vitamer) is produced at a level 
greater than 50 mg/L. In yet another embodiment, the invention features a production 
method that includes culturing a recombinant microorganism under conditions such that 
the desired product (e.g., a B6 vitamer) is produced at a level greater than 150 mg/L. 
20 Depending on the biosynthetic enzyme or combination of biosynthetic 

enzymes manipulated, it may be desirable or necessary to provide (e.g., feed) 
microorganisms of the present invention at least one biosynthetic precursor such that the 
desired compound or compounds are produced. The term "biosynthetic precursor" or 
"precursor" includes an agent or compound which, when provided to, brought into 
25 contact with, or included in the culture medium of a microorganism, serves to enhance 
or increase biosynthesis of the desired product. In one embodiment, the biosynthetic 
precursor or precursor is glutamine. In another embodiment, the biosynthetic precursor 
or precursor is ribose. The amount of glutamine or ribose added is preferably an amount 
that results in a concentration in the culture medium sufficient to enhance productivity 
30 of the microorganism (e.g. , a concentration sufficient to enhance production of a B6 
vitamer). The term "excess ribose or glutamine" includes ribose or glutamine levels 
increased or higher that those routinely utilized for culturing the microorganism in 
question. For example, cmturing the Bacillus microorganisms described in the instant 
Examples can be done in the presence of about 0-5 g/L ribose or glutamine. 
35 Accordingly, excess ribose or glutamine levels can include levels of about 5-1 0 g/L or 
more preferably about 5-20 g/L ribose or glutamine. Biosynthetic precursors of the 
present invention can be added in the form of a concentrated solution or suspension 
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(e.g., in a suitable solvent such as water or buffer) or in the form of a solid (e.g., in the 
form of a powder). Moreover, biosynthetic precursors of the present invention can be 
added as a single aliquot, continuously or intermittently over a given period of time. 
Methods for culturing other recombinant organisms, e.g., the 

5 recombinant plant organisms of the present invention, are known to one of skill in the 
art. For example, conditions for culturing and maintaining recombinant plant cells are 
described in Choi J.W., et al. (2001) Adv Biochem Eng Biotechnol 72:63-102; James E., 
Lee J.M. (2001) Adv Biochem EngBiotechnol 72:127-56; Fischer R. and Emans N. 
(2000) Transgenic Res. 9(4-5) :279-99; Fischer R., et al. (2000) J Biol Regul Homeost 

10 Agents Apr-Jun;14(2):83-92; Kutchan T.M. (1996) Gene 1996 Nov 7;179(1):73-81; 
Creissen G., et al. (1996) Biochem Soc Trans. May,24(2):465-9; McKenna T.S., et al. 
(1996) J Biotechnol Jan 26;44(l-3):83-9; Ma J.K. and Hein M.B. (1995) Plant Physiol 
Oct;109(2):341-6; Taticek R.A., Lee C.W., Shuler M.L. (1994) Curr Opin Biotechnol 
Feb;5(2):165-74; Arathoon W.R., Birch J.R (1986) Science Jun 13;232(4756):1390-5; 

15 Murray K. (1980) Philos Trans R Soc Lond B Biol Sci Aug ll;290(1040):369-86. 

Furthermore, methods for culturing and maintaining the recombinant yeast organisms of 
the invention are also known to one of skill in the art, as described in, for example 
Kjeldsen T. (2000) Appl Microbiol Biotechnol Sep;54(3):277-86; Fischer R, et al. 
(1999) Biotechnol Appl Biochem Oct;30 (Pt 2):117-20; Mendoza-Vega O., Sabatie J, 

20 Brown S.W. (1994) FEMS Microbiol Rev Dec;15(4):369-410; Moir D.T., Mao J.I. 
(1990) Bioprocess Technol 9:67-94. 

This invention is further illustrated by the following examples which 
should not be construed as limiting. The contents of all references, patents and 
25 published patent applications cited throughout this application are incorporated herein 
by reference. 



EXAMPLES 



30 EXAMPLE 1 : Biological assay for B6 vitamers using Saccharomyces uvarum . 

Quantitation of B6 vitamers in supernatants of cultures of micro- 
organisms or extracts of organisms that have been genetically modified to increase 
production of B6 vitamers is conveniently done using Saccharomyces uvarum (formerly 
and still often named S. carlsbergensis) strain ATCC 9080 as the indicator strain or test 

35 organism. The method is essentially that described in the Difco Manual (1984, Difco 
Laboratories, Detroit, MI, 10th Edition, pp. 1 104-1 106), with the modification that 50 
mg/liter of streptomycin sulfate is added to the liquid growth medium for the test 
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organism. However, any other appropriate indicator organism may be used, together 
with a medium that is appropriate for that organism that is free of B6 vitamers. For 
example, an E. colipdxB mutant can be used in a standard minimal medium that is well 
known in the art, such as M9 glucose minimal medium (Miller, J., (1972) Experiments 

5 in Molecular Genetics, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY). 

When using S. uvarum strain ATCC 9080 as the indicator strain, Bacto 
Pyridoxine Y Medium (Difco Laboratories, available through VWR Scientific, Inc.), 
supplemented with 50 mg/liter streptomycin sulfate, is used for the serial dilutions, and 
PN, PL, or PM is used to generate the standard curve. The responses to these three 

10 standard compounds are almost identical to each other with S. uvarum strain ATCC 
9080 (Figure 3). 

EXAMPLE 2: Deletion of a portion of the yaaDE operon in B. subtilis. 

The SOR and SNO genes of Cercospora nicotianae were originally 

15 identified by mutations that lead to hypersensitivity to singlet oxygen-generating 
reagents (Ehrenschaft, M., et al. (1999) Proc. Natl. Acad. Sri. USA 96: 9347-9378). 
Mutations in either of these genes also lead to PL auxotrophy. The protein sequences 
obtained from translation of the SOR and SNO open reading frames were used as 
homology probes to search through the B. subtilis genome sequence using the BLAST 

20 homology search program of the Subtilist website. The SOR protein was significantly 
homologous to the YaaD protein, and the SNO protein was significantly homologous to 
the YaaE protein. Moreover, the genes encoding the YaaD and YaaE proteins (namely 
yaaD znAyaaE) occur adjacent to each other on the B. subtilis chromosome as a two 
gene operon. 

25 General methods for growth, storage, transformation, and molecular 

biology ofB. subtilis strains are given in Harwood, C, and Cutting, S. (1990), Molecular 
Biological Methods for Bacillus, John Wiley and Sons, New York, NY, hereby 
incorporated in its entirety by reference. The yaaDE operon DNA sequence was 
amplified using the Polymerase Chain Reaction (PCR) with Pfu Turbo DNA polymerase 

30 (Stratagene, Inc., used according to the manufacturer's instructions). The DNA primers 
used were RY395 (SEQ ID NO: 1) and RY396 (SEQ ID NO:2). RY395, the upstream 
primer, introduces an Xbal site and artificial ribosome binding site. RY396, the 
downstream primer, introduces a Bamffl site. The template DNA was chromosomal 
DNA isolated form wild type B. subtilis strain PY79. The blunt ended PCR product was 

35 ligated into the EcoRV site of pGEM5Zf(+) (Promega, Inc.) to give plasmid pAN368. 
Next, a gram positive chloramphenicol resistance gene on a blunt DNA fragment was 
ligated into pAN368 that had been cut with Hpal, to give plasmid pDXIF (SEQ ID 
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NO:5, Figure 4). pDXIF therefore is deleted for a portion oiyaaD and a portion of 
yaaE. pDXIF was used to transform wild type B. subtilis strain PY79 to 5 mg/liter 
chloramphenicol resistance, and a double crossover event was confirmed using PCR and 
the same primers used to clone yaaDE. The resulting strain was named PX1 . 

5 PX1 was able to grow on Spizizen's minimal medium with trace 

elements (SMM) (Harwood, C, and Cutting, S. (1990) Molecular Biological Methods 
for Bacillus, John Wiley and Sons, New York, NY, pp. 548-549) supplemented with 2 
uM pyridoxal HC1 (Sigma-Aldrich Chemical Co.), but it did not grow without the 
supplement. Thus it was established that at least one oiyaaD oxyaaE is required for 

1 0 PLP synthesis in B. subtilis. 



EXAMPLE 3: Deletion of yhaF in B. subtilis. 

The protein sequence of the E. colipdxF gene was used as a probe to 

1 5 search the B. subtilis genome as described in Example 1 . The only significant homolog 
was yhaF. In a fashion similar to that of Example 1, the yhaF was cloned and deleted 
from the chromosome of PY79 using plasmid pDXl IF (SEQ ID NO:6, Figure 5), to 
give strain PX1 1 . The PCR primers used to clone yhaF were RY407 (SEQ ID NO:3) 
and RY408 (SEQ ID NO:4). The restriction sites used for insertion of the antibiotic 

20 resistance gene were the PshAl and Ehel sites in the yhaF coding region. PX1 1 is a 

serine auxotroph, but not a PL auxotroph. By comparison to E. coli, it appears that yhaF' 
functions in serine synthesis and probably encodes the equivalent of SerC, but that the 
YhaF protein is not required for PLP synthesis in B. subtilis. Therefore, it is established 
that sequence homology alone does not necessarily imply functional homology. 

25 

EXAMPLE 4: Overexpression of the yaaDE operon in B. subtilis. 

The Xbal to BamHI fragment from pAN368 that contains the yaaDE 
operon and artificial ribosome binding site was inserted into either of two expression 
vectors, to yield plasmids pDX14R (SEQ ID NO:7) and pDX17R (SEQ ID NO:8), 
30 respectively. In pDX14R and pDX17R the yaaDE operon is expressed from the strong 
constitutive B. subtilis phage SP01 promoters, P 26 and Pis, respectively {see Figures 6 
and 7). 

pDX14R and pDX17R were each transformed into wild type B. subtilis 
strain PY79, selecting for chloramphenicol resistance. The plasmids integrate into the 
35 chromosome at the yaaDE locus by single crossover. The resulting strains were named 
PX14 and PX17, respectively. 
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PX14 and PX17 were grown for 48 hours at 37°C in 5 ml test tube 
cultures in a roller drum at about 100 rotations per minute. The culture medium was 
SVY (20 g Difco Veal Infusion Broth, 5 g Difco Yeast Extract, 2 g ammonium sulfate, 5 
g sodium glutamate, and 30 g glucose per liter, buffered with 200 mM potassium 
phosphate, pH 7.0). Cells were removed by centrifugation followed by sterile filtration 
(Millipore 0.45 micron), and the supernatant solutions were assayed for PL equivalents 
using the biological assay described in Example 1. The parent strain, PY79 was grown 
and processed in similar fashion as a control. The uncultured SVY medium was assayed 
as another control, since it was likely that the SVY medium contained a measurable 
level of B6 vitamers. The results are shown in Table 2, below. 



25 



Table 2: 



Production of B6 vitamers by Bacillus submits derivatives in 48 
hour test tube cultures grown in SVY 



Strain 


Cassette 


Integration 
Target 


ODeoo 


Total B6 
Vitamers 1 
mg/liter 


NetB6 
Vitamers 2 
mg/liter 


PX14 


P 2 &aaDE 


yaaDE 


17 


7.2 


7.0 


PX17 


PisyaaDE 


yaaDE 


17 


4.9 


4.7 


PX1 


AyaaDE 


yaaDE 


8 


0.4 


0.2 


PY79 






19 


0.8 


0.6 


(Medium) 






0.08 


0.2 


(0) 



~ " " J ■» ' 

carlbergensis as a source of vitamin B6 for growth. 

2 Calculated by subtracting the amount assayed in the medium. 



After subtracting the B6 vitamers contained in the medium, strain PX14 
20 produced 7.0 mg/liter PL equivalents, while the parent PY79 produced only 0.6 mg/liter 
of PL equivalents. Thus, expression of the yaaDE operon has been shown to be rate 
limiting for B6 vitamer production in B. subtilis. Moreover, a genetically modified 
strain, where this rate limiting step was enhanced, produced more than a ten-fold 
increase in B6 vitamer secretion compared to that of the parent. 



EXAMPLE 5: 



30 



Shake flask experiments to determine B 6 vitamer production 
from B. subtilis strain PX14 containing an amplifiable P 2 c 
yaaDE cassette. 
To further test B 6 production, PX14 and PY79 were grown in 
parallel in shake flasks in soy flour, maltose, MOPS medium (SMM). The 200 
ml baffled shake flasks contained 40 ml of medium, were covered with 4X 
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Bioshield covers, and shaken at 37°C and 300 rpm. Samples were taken at 24 
and 48 hours and analyzed for vitamer concentration by HPLC (Table 3). 

Titers produced by the parental control strain PY79 were 3 mg/1. PX14 
produced a B 6 titer of 45 mg/L when grown in shake flasks containing SMM. In shake 

5 flasks, where the overall vitamer concentration increased with time, the amount of PN 
produced increased and exceeded that of PMP at 48 hours (Table 3). 

B. subtilis has a fairly broad temperature range of growth up to 
approximately 50°C. A significant increase of B 6 titer was seen with B. subtilis strain 
PX14 when grown in shake flasks with SMM at 43°C as compared to 37°C (Table 4). 

10 The B 6 titer increased more than 25% at 43°C as compared to that at 37°C. 

TABLE 3. Vitamer composition of PX14 and PY79 grown in shake flasks in 
SMM 3 . 



Strain 


Time point (Jar.) 


PMP 1 


PM 1 


PL 1 


PN 1 


Total Vitamer 2 


PX14 


24 


10.8 


. 2.7 


0.0 


6.4 


20 




48 


15.5 


8.5 


1.7 


19.0 


45 


PY79 


24 


1.0 


0.0 


0.0 


0.0 


1 




48 


2.9 


0.4 


0.0 


0.0 


3 



15 1 PMP, PM, PL and PM are reported in mg/1 and quantified by HPLC. 

2 Total vitamer is equal to the sum of PMP, PM, PL and PM and reported in mg/1. 

3 SMM contains in a final volume of 200 ml: 4.0 g soy flour, 1.6 g (NH&SCX,, 1.0 g monosodmm 
glutamate, 2.0 ml 100 X PSTE trace minerals, 5.0 ml 4 M potassium phosphate buffer pH 7.2, 40 ml 1 .5 
M MOPS buffer pH 7.2, and deionized water. , 

20 Added after autoclaving at 20X concentrate to give a final concentration of 3% maltose wnh 5 mM ^S™* 
chloride and 0.7 mM calcium chloride. 100 X PSTE contains 0.2 g/1 MnCl 2 H 2 0, .15 g/1 ZnS0 4 7 H 2 0, 0.2 g/1 
C0CI26 H 2 0, 0.025 CuS0 4 -5 H 2 0 and 0.075 g/1 Na 2 MoO„-2H 2 0. 

25 TABLE 4. Vitamer composition of PX14 grown in shake flasks in SMM for 48 
hours. 

Strain Temperature (°0 PMP 1 PM 1 PL 1 PN! Total Vitamer 2 

•V7 15 9 2 19 45 

PX14 37 13 y 

43 11 13 1 36 61 
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1 PMP, PM, PL and PM are reported in mg/L and quantified by HPLC. 

2 Total vitamer is equal to the sum of PMP, PM, PL and PM and reported in mg/L. 

EXAMPLE 6: Complementation of E. colipdx mutants by plasmids that express 
the B. subtilis yaaDE operon. 
Plasmid pDX14R, described above in Example 4, was used to transform 
various E. coli strains that contained mutations that lacked function in each of the known 
genes involved in PLP biosynthesis (except for dxs, which is an essential gene for E. 
coli). The selection was for resistance to 250 mg/liter ampicillin. Each transformant 
was then tested for growth on minimal medium (SMM with 0.5 % glucose, see Example 
2) supplemented with 100 mg/liter serine, and compared to growth of its respective 
untransfonned parent on the same medium. All mutations tested were complemented by 
pDX14R. Specifically, pdxA, pdxB, pdxF, pdxJ, and pdxH, were all complemented by 
pDX14R. Therefore, expression of the B. subtilis yaaDE operon in E. coli is sufficient 
for PLP biosynthesis, even in the absence of any one of the above functional pdx genes. 
Several important and unexpected conclusions or inferences can be drawn form these 
results. First, the substrate(s) for the enzyme(s) encoded by yaaDE must be present in 
E. coli, even when a biosynthetic intermediate normally used to make PLP is absent or 
greatly reduced. Second, PNP or PLP is possibly the product of the enzyme(s) encoded 
by yaaDE. Third, since an early block in the E. coli PLP biosynthetic pathway (for 
example that in a pdxB mutant) does not prevent yaaDE from complementation, the 
substrates for the enzyme(s) encoded by yaaDE are not likely to be the same as for the 
last step in PMP or PLP synthesis in wild type E. coli. These unexpected results lead to 
the possibility of producing B6 vitamers using B. subtilis yaaDE or the homologous 
genes from another organism (for example, but not limited to, SOR and SNO from 
Cercospora nicotianae or PDX1 and PDX2 from S. cerevisiae) in a heterologous host 
species, including, but not limited to, E. coli and Oryza sativa. 

EXAMPLE 7: Overexpression of the yaaDE operon in E. coli. 

Plasmids pDX14R and pDX17R were transformed into E. coli strain 
DH5a (New England Biolabs), selecting for ampicillin resistance. The transformants 
were grown for 48 hours in 5 ml test tube cultures at 37°C, and the supernatants were 
worked up as in Example 3. The assay results for PL equivalents are shown in Table 5, 
below. 
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Production of B6 vitamers by Escherichia coli harboring plasmids 



Strain 


Plasmid 


OD 600 


Total B6 ! 


NetB6 




Cassette 




Vitamers 2 


Vitamers 3 








mg/liter 


mg/liter 


DH5a 


P 2 &aaDE 


7.6 


3.2 


3.1 


DH5a 


PisyaaDE 


8.2 


3.2 


3.1 


DH5a 




9 


0.1 


(0) 



2 Sum of PN, PL, PM, and derivatives thereof that can be utilized by pyridoxine indicator strain S. 
carlbergensis as a source of vitamin B6 for growth. 

3 Calculated by subtracting the amount assayed in DH5a not containing plasmid. 

Thus it has been shown that the yaaD and yaaE genes can be expressed 
in a heterologous host strain, and B6 vitamers can still be overproduced. By extension 
of this approach, HheyaaD and yaaE genes of B. subtilis can be overexpressed in any 
organism where an overexpression system exists, and in the resulting strains, B6 
vitamers will be overproduced. Overproduction of the rate limiting enzyme for B6 
vitamer production in any organism that is capable of producing B6 vitamers will lead to 
overproduction of B6 vitamers. 



EXAMPLE 8: 



Test tube experiment to determine Be vitamer production 
from E. coli strain DH5a (pDX14) grown on SVY 
supplemented with pentose or hexose carbon sources. 

In order to test whether feeding pentoses could enhance B 6 synthesis, E. 
coli strain DH5ot harboring a plasmid containing the P 2 6 yaaDE cassette were grown in 
a variety of SVY based media containing a pentose (xylose or ribose) or hexose (glucose 
or maltose) as the carbohydrate source. The strains were also grown on SVY medium 
containing no carbohydrate source as a control. Production of Be vitamers from E. coli 
strain DH5cc(pDX14) grown in SVY ribose is greater than that from E. coli strain 
DH5a(pDX14) grown in SVY xylose, glucose , arabinose or maltose. More than 60 
mg/1 of total B 6 vitamers have been produced in the DH5a(pDX14) cultures grown in 
SVY ribose test tube with PN as the major product (Table 6). 
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TABLE 6. Production of B« vitamers by E. coU harboring a plasmid containing 
engineered B. subtUis genes and grown in the presence or absence of hexose or 
pentose carbon sources. 

"strain" \ ODeoo Total Vitamer 1 



DH5a(pDX14) 




8 


Glucose 


9 


Ribose 


16 


61 


Xylose 


8 


31 
21 


Arabinose 


7 


None 


7 


25 


DH5a 






Glucose 


9 


4 


Ribose 


18 


5 


Xylose 


9 


4 


Arabinose 


9 


1 


None 


7 


3 



PMP, PM, PL, and PN. 



EXAMPLE 9 : Vitamin B 6 titers of a pdxB E. coli strain producing YaaD in 

the absence of YaaE 

10 In order to determine the titers of B 6 produced from YaaD alone, in the 

absence of YaaE , E. coli strain WG1012 (pdxB) was transformed with pDX14 and 
pDX24 (plasmid isolates N2, D2 and D6), and control plasmid pBR322. These 
transformed strains were grown in 5 mL test tube cultures in SVY ribose in roller drums 
at 37°C for 48 hours. Ribose was chosen as the carbon source, since past experiments 

1 5 showed that ribose improved B6 vitamer production in E. coli transformants. B 6 titers of 
culture supernatants were determined by HPLC assay. In all cases, the plasmids that 
complemented the pdxB strain led to measurable production of B 6 , whereas as the 
plasmids (i.e. pBR322) that did not complement did not lead to any measurable B 6 
(Tables 7 and 8). Several isolates of pDX24, which contains the yaaD ORF, produced 

20 approximately 7-10 mg/L of the B 6 vitamers. However, E. coli strain WG1012 (pdxB) 
containing pDX24 produced only a fraction of the amount of B 6 as compared to the E. 
coli strain WG1012 harboring pDX14 containing the P 2 gyaaDE cassette (Table 8). 

Apparently, YaaD is functional by itself and can catalyze the synthesis 
of vitamers similar in composition to those synthesized by YaaDE, albeit less 

25 efficiently. It appears that while YaaE is not absolutely essential for B 6 synthesis, it 

provides a supportive role to YaaD, thereby leading to higher B 6 titers. YaaD is also able 
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to ftmctioB independently of YaaE in B. subtilis, however, less efficiently than when 
YaaE is present. Expressing a copy of P 26 yaaD in either PX1 or PY79 resulted in higher 
titers of vitamers as compared to the respective parental controls. 



5 TABLE 7. YaaD complements a pdxB E. coli anxotroph. 

Strain and plasmid Cassette PDCfree PDC free + 2uM PL 

E. colipdxB' 

pBR322 None - + 
pDX14 P 2 <yaaDE + + 
pDX24 Pi&yaaD ±_ ± 



10 



TABLE 8. Vitamer profile of E. colipdxK strains harboring a plasmid containing a 



Plasmid 


Isolate 


Cassette 


ODeoo 


PM 


PL 


PN 


PM+PL+P1 
(mg/L) 


pDX14 




P 2( yaaDE 


14.1 


0.6 


0.0 


28.8 


29.4 


pDX24 


N2 
D2 
D6 


P 2 Q>aaD 
P 2 $>aaD 
P 2 dyaaD 


12.2 
15.5 
13.6 


0.0 
0.0 
0.0 


2.0 
0.0 
0.0 


7.4 
6.9 
9.7 


9.4 
6.9 
9.7 


pBR322 
SVY-ribose 






12.4 


0.0 
0.0 


0.0 
0.0 


0.0 
0.0 


0.0 
0.0 



hours. Vitamers were quantified by HPLC. 



The YaaD and YaaE protein sequences were used as probes to search the 
1 5 NCBI database for homologs in plants using the BLAST™ program which can be found 
at the National Center for Biotechnology Information website (Altschul S.F (1990) 
J. Mol Biol. 215(3):403-10). Several homologs of YaaD were found in several genera 
of plants, including Arabidopsis, Oryza, Ginkgo, Hevea, Phaseolus, and Stellaria. Two 
homologs of YaaE were found in Arabidopsis thaliana. However, no homologs of pdxA 
20 andpdxJ were found. Therefore, the plant kingdom appears to use the Type B Pathway 
for B6 vitamer biosynthesis. Thus for example, overexpression of the YaaD homolog 
(GenBank accession number AAL73561) from Oryza sativa (rice), and the A. thaliana 
homolog of YaaE (GenBank accession number AB01 1483) together in a plant using 
methods well know in the art, such as expression from the Cauliflower Mosaic Virus 
25 35S promoter, will lead to overproduction of B6 vitamers in that plant. 
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EXAMPLE 1 0: Other routes to increasing the activity of enzymes involved in 

B6 vitamer synthesis. 

The overexpression of the yaaDE operon leads to an increase in the 
5 amount of the encoded enzyme(s), which in turn leads to an increase in the total activity 
of said enzyme(s). Increase in this activity leads to an increase in the production B6 
vitamer. Other methods can he used to increase the activity of the relevant enzyme(s) 
under conditions of B6 vitamer production. For example, in addition to increasing the 
amount of a relevant enzyme(s), the total activity of the relevant enzyme(s) can he 
1 0 increased by mutating the gene(s) to increase the specific activity of the enzyme(s), 
and/or by mutating the gene(s) to encode a feedback resistant variant of the enzyme(s). 
Such desirable mutations can be obtained by screening large numbers of mutants for the 
increased activity as evidenced by an increase in B6 vitamer production as described in 
Example 4, or by selecting for mutants that are resistant to inhibitors that are specific for 
15 the PLP biosynthetic pathway, and screening among those mutants for an increase in B6 
vitamer production. Examples of such inhibitors are isoniazid, iproniazid, and 
ginkgotoxin (4' -methoxy pyridoxine) (Dempsey and Arcement (1 97 1) J. Bacteriol. 
107(2): 580-582; Pflug, W., and Lingens, F., (1978) Hoppe-Seyler's Z. Physiol. Chem. 
359: 559-570; Fiehe, K., et al., (2000) J. Nat. Prod. 63(2): 185-189). 

20 

EXAMPLE 1 1 : Processing of biosynthetic B6 vitamers. 

A B6 vitamer produced by a genetically modified organism of the 
invention can be harvested and processed into a format that is appropriate for 
commercial use. For example, after culturing a B6 vitamer producing micro-organism in 

25 liquid culture, the entire culture, including cells can be dried by evaporation or by spray 
drying, and the resulting powder can be mixed into animal feeds. Alternatively, the cells 
can be first removed by centrifugation or filtration, and the resulting supernatant 
solution can be dried as described above. As another alternative, the B6 vitamer can be 
purified from the culture broth by techniques well known in the art, such as filtration, 

30 reverse osmosis, column chromatography (ion exchange, hydrophobic or hyrophilic 

adsorption, gel filtration, etc.), solvent extraction, precipitation, distillation, evaporation, 
and the like. If the B6 vitamer producing organism is a plant, then the appropriate 
portion of the plant (for example the leaves, stems, roots, flowers, fruits, seeds, or any 
combination thereof) can be harvested and processed. For example the plant material 

35 can be dried and used directly, or the material can be pulverized or ground and the B6 
vitamer extracted and/or processed as described above for cultures. 
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The production organism can be a micro-organism that normally inhabits 
the gut of humans or an animal if interest (for example one of many bacteria of the 
genus Lactobacillus, such as L. acidophilus ), and the B6 vitamer can be delivered by 
ingestion of the organism. 

5 

EXAMPLE 12: Genes homologous to yaaDE expressed in host organisms for 

the production of B6. 

Preferred methods for the production of vitamin B6 involve expression of 
Bacillus subtilis yaaD and/or yaaE or expression of genes that are homologous to 
10 Bacillus subtilis yaaD and/or yaaE in host organisms (type A or type B organisms) such 
as E. coli, S. cerevisiae, Arabidopsis thaliana, etc. In order to identify genes or proteins 
that are homologous to yaaD and yaaE a procedure such as a protein or nucleotide 
BLAST™ (Basic Alignment Search Tool) search can be implemented (Altschul, 
Stephen F., et al (1997) Nucleic Acids Res. 25 :3389-3402). The BLAST™ search tool 
15 and can be found and used at the National Center for Biotechnology Information 
website. BLAST™ is an algorithm designed to compare sequences and provide 
sequence alignments between two or more previously characterized sequences and also 
provides a method for rapid searching of nucleotide and protein databases. A typical 
description line of a BLAST output table is given as 
20 "gi|586857|sp|P37527|YAAD_BACSU 31.5 kDaguanylylatedprotei... 506 e-142" 
where "gi" is the GenBank identifier of the subject which in this case is number 
"586857", "sp" represents the databank from which the subject was retrieved, "P37527" 
represents the GenBank accession number and "YAAD_BACSU 31.5 kDa 
guanylylated protei..." is a brief description of the sequence, the number "506" 
25 represents the bit score of the highest-scoring HSP's (high scoring pairs) and "e-142 " is 
the E value that represents the statistical significance of a given alignment. The E value 
also reflects the size of the database and the scoring value used. The lower the E value, 
the more significant the homology. The description lines in the BLAST™ output table 
(Table 9, below) represents a typical BLAST™ query of the primary amino acid 
30 sequence of B. subtilis YaaD and is sorted by increasing E value with the most 
significant alignments (lowest E values) at the top. A similar BLAST™ query of 
primary amino acid sequence of B. subtilis YaaE is represented in Table 10, below. 
These alignments represent a small portion of the protein sequences homologous to 
YaaD and YaaE protein sequences from various organisms as the sequences from many 
35 organisms have not been, or only partially, categorized into the current databases. 
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Table 9: Sequences producing significant alignments to Bacillus subtilis YaaD 

Score E 
(bits) Value 

586857 |sp|P3752 7 |YAAD_BACSTJ 31.5 kDa guanylylated protei... 506 e-142 



gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 



SOR_SNZ, SOR/SNZ family [Bacil...453 e-126 
superoxide -inducible protein [B...437 e-122 

unknown [Pasteurella multocida. . .435 
similar to a protein required f...431 
superoxide-inducible protein 7{. 
similar to a protein required f . 
|H89818 conserved hypothetical protein SA04 . 
BAC06852.l| superoxide -inducible protein [B. 
gb]zP__00070557.l| hypothetical protein [Oenococ . . . 392 e-108 
gi'l748230l|pir| |F69188 ethylene -inducible protein - Methano . . . 3 6 0 le-98 
gi|22970419|gb|ZP_00017505.l| hypothetical protein [Chlorof . . . 359 3e-98 



21397980 
10172634 
15603097 
16414718 
22778374 
16411571 
25364956 
22090614 
23038401 



ref 
dbj 
ref 
emb 
dbj 
emb 
pir 
dbj 



NP_653965.l| 
BAB03741.l| 
NP_246169.l| 
CAC97434.1 
BAC14643-1 
CAD00179.1 



e-121 
e-120 
. .430 e-119 
. .428 e-119 
. .424 e-118 
. .418 e-116 



gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 



6459121 1 gb | AAF10938 . 1 1 AE001982_JL2 
13310484 " 



23017679 
18481963 
11358736 



singlet oxygen resista. . . 346 2e-94 
gb|AAK18310.i|AF344827_l Sor-like protein [Gink... 342 5e-93 
gb|ZP_00057403 .l| hypothetical protein [Thermob. . .340 le-92 
gb|AAL73561.l|AC079632_5 Putative ethylene-indu. . . 339 2e-92 
pir||T48163 pyridoxine biosynthesis protein-lik. . . 338 4e-92 
3123099) sp | 014027 | YEM4_SCHPO Hypothetical protein C29B12. 
11359355 |pir | |T46647 pyridoxine biosynthesis protein pyr. 
2129913 |pir| |S60047 ethylene -responsive protein 1 - Para. 
10719739 |gb|AAG17942.l| putative pyridoxine biosynthetic . 
11362638 |pir | |T46646 pyridoxine biosynthesis protein pdx. 
20094807 | ref |NP_614654.1 | Pyridoxine biosynthesis enzyme. 



7462114 |pir 
21617922 |gb 
2193896 |emb 
20807305 
15827141 
13878067 
21220023 
18076239 
20090425 



15842146 
25364954 
23335752 
21323554 
19552014 
23465712 
21228534 



.338 7e-92 
.338 8e-92 
.337 le-91 
.337 2e-91 
.336 2e-91 
.336 3e-91 
.335 4e-91- 
.335 5e-91 
.333 le-90 
.333 2e-90 
.332 3e-90 
.332 3e-90 
.330 le-89 
.330 2e-89 
.328 4e-89 
.328 5e-89 
.327 9e-89 
.327 2e-88 

ref |NP_337183 .l| pyridoxine biosynthesis protei...327 2e-88 
pir||A96973 probable phosphate -utilizing enzyme... 323 2e-87 
gb|ZP_00120985.l| hypothetical protein [Bif idob . . . 322 4e-87 



,A72372 conserved hypothetical protein - The. 
AAM66972.l| pyridoxine biosynthesis protein-. 
f CAB09637 .l| hypothetical protein MLCL581 . 12c . 
ref |NP_622476.ll Pyridoxine biosynthesis enzyme, 
ref |NP_301404.l| putative pyridoxine biosynthes . 
gb|AAK44111.l|AF370296_l putative S0R1 from the. 



NP__625802 . 1 1 conserved hypothetical protein . 
CAC81977.l| err-related and stress induced . 
NP_616500.l| pyridoxine biosynthesis protei . 
6137073 |emb|CAB59635.l| ethylene responsive receptor, ER. 
23052101 |gb|ZP_00078783.l| hypothetical protein [Methano. 
3123 042 jsp | 006208 |YQ06_MYCTU Hypothetical protein Rv2606. 



ref 
emb 
ref 



dbj 
ref 
ref 
ref 



BAB98181.l| 
NP_600016.1 
NP_696315.1 
NP 634456.1 



Pyridoxine biosynthesis enzyme 
pyridoxine biosynthesis enzyme, 
widely conserved protein in up, 
putative pyridoxine biosynthes. 



ICJL IMr- oj-xtxww.-^ j^v*^v-.w^ » >- j- j 

2501580 |sp|Q58090|Y677JVlETJA Hypothetical protein MJ0677. 



23493620 
13541829 
18893665 
14591161 
14521000 
15622519 



dbj |BAC18589.1 
ref |NP_111517.1 
gb|AAL81653 .l| 



.322 5e-87 

.322 5e-87 

.320 le-86 

.318 5e-86 

.317 le-85 

.314 le-84 

313 2e-84 



ref 
ref 
dbj 



conserved hypothetical protein 
Predicted phosphate -utilizing 
ethylene-inducible protein homol...312 4e-84 
ethylene-responsive protein [P...311 6e-84 
ETHYLENE -RESPONSIVE PROTEIN [P. 
336aa long hypothetical stress-, 
hypothetical protein [Sulf olobus . 
probable pyridoxine biosynthesi. 
js55082 hypothetical protein YMR096w - yeast... 303 2e-81 
AAK75562.l| pyridoxine biosynthesis protein ...302 5e-81 
|D69313 ethylene -inducible protein homolog -...301 7e-81 
AAL00126.l| Pyridoxine biosynthesis protein ...301 9e-81 



NP_143237.1| 
NP_126475.ll 
BAB66510.1] 



6015903 |emb|CAB57730.1, 
1063 9689 |embjCAC11661.1 | 
1078513 |pir " " 
14972957 |gb 
7483295 |pir 
15458966|gb 



.311 7e-84 

.308 4e-83 

.307 le-82 

.304 le-81 



19704795 | ref | NP_6 04357.1 | pyridoxine biosynthesis protei... 299 4e-80 
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20 



gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 
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1074877|pir | |F64173 hypothetical protein HI1647 - Haemop. 
12802367 |gb|AAK07850.l|AF309689_12 Snz-type pyridoxine v. 
3172534 |gb|AAC18606.l| hypothetical protein IP1 [Francis. 
22405887|gb|ZP_00000750.l| hypothetical protein [Ferropl . 
1730655 |sp|P53824|SNZ2_YEAST SNZ2 PROTEIN >gi | 6323996 | re . 
1084717 |pir| |S56196 hypothetical protein YFL059W - yeast. 

pir||A84331 hypothetical protein Vngl793c [impo. 
gb|EAA16340.l| ethylene -inducible protein never. 



25364947 
23479541 
23612291 
14600562 
18313608 



ref NP_703871.1 pyridoxine biosynthetic enzyme., 
ref NP_147079-1 ethylene -responsive protein 1 . . 
ref NP_560275.1 stress induced protein, conjee. 
4103954|gb|AAD01898.l| A37 [Arabidopsis thaliana] >gi|41.. 
2501581 |sp|Q5084l|YTR5_METVA Hypothetical protein in tRN. 
44737 |emb|CAA25434.l| unidentified reading frame [Methan. 
2501579 |sp|Q41348|H47_STELP HYPOTHETICAL PROTEIN H47 
7488891 |pir | |S71492 ethylene -responsive protein 2 - Para. 
297176|emb|CAA50602.l| expressed ORF [Stellaria longipes . 
21671884 gb|AAM74246.l|AC074355_8 Putative protein simil. 
20521394 dbj |BAB91905.l| putative ethylene -responsive pro 
22138453 gb | AAM93437 . 1 1 putative ethylene -inducible prot. 
3335369|gb|AAC27170.l| similar to SOR1 from the fungus C. 
|l5217264|gb|AAK92608.l|AC078944_19 Hypothetical protein . 



.298 
.297 
.294 
.293 
.293 
.292 
.288 
.288 
.287 
.285 
.279 
.249 
.237 
.218 
207 
.205 
.196 
.100 
. .90 
. 84 
. 78 
. 57 



5e-80 

le-79 

le-78 

2e-78 

2e-78 

3e-78 

9e-77 

9e-77 

le-76 

5e-76 

4e-74 

2e-65 

le-61 

6e-56 

le-52 

5e-52 

4e-49 

2e-20 

4e-17 

3e-15 

le-13 

2e-07 
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30 



35 



40 



45 



50 



55 



gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 

gi 



Sequences producing significant alignments to Bacillus subtilis YaaE 

Score E 
(bits) Value 

467402 | dbj |BAA05248.l| unknown [Bacillus subtilis] >gi|2...382 e-105 



21397981 
10172635 
22090615 
23038400 
14246288 
16414719 
15603098 
22778373 
15643238 
16411572 
21220022 
2145628 
7447522 
13092704 | emb 
18977900 I ref 



ref 
dbj 
dbj 



NP_653966.lj 
BAB03742.l| 
BAC06853.l| 
gb | ZP_0007 0556.1 
dbj |BAB56682.l| 
CAC97435.1| 
NP_246170.l| 
BAC14642.l| 
NP_228282 .l| 
CAD00180.l| 
NP_625601.l' 
S72721 
C70570 



emb 
ref 
dbj 
ref 
emb 
ref 
pir| 
pir' 



SNO, SNO glutamine amidotransf.. 
ami do transferase [Bacillus halo.. 
2-deoxy-scyllo-inosose synthase. . 

hypothetical protein [Oeriococ . . 
conserved hypothetical protein . 
Iin2206 [Listeria innocua] >gi | . 

unknown [Pasteurella multocida. 
amidotransf erase [Oceanobacillu. 

amidotransf erase, putative [Th. 
Imo2102 [Listeria monocytogenes, 
conserved hypothetical protein, 
amidotransf erase hisH homolog - Myco. 
hypothetical protein Rv2604c - Mycob. 



.206 
.190 
.190 
.187 
.187 
.184 
.176 



.176 
.174 
.172 
.172 



CAC29982.l| conserved hypothetical protein 
NP_579257.l| imidazoleglycerol -phosphate sy 
6459120 | gb | AAF10937 . 1 1 AE001982_11 amidotransf erase HisH, 
23017678 |gb|ZP_00057402.l| hypothetical protein [Thermob 
7447524 |pir I |D71007 hypothetical protein PH1354 - Pyroco...l62 
9757756|dbj |BAB08237.l| amidotransf erase hisH-like prote...l62 
14521001 | ref |NP_126476 . 1 | imidazoleglycerol -phosphate sy . 
21228535 ref |NP_63 4457 . 1 j Imidazoleglycerol -phosphate sy. 
23052100 gb|ZP_00078782.l| hypothetical protein [Methano. 
25314121 pir| |H95170 conserved hypothetical protein SP14 . 
19915437 gb|AAM04979.l| pyridoxine biosynthesis protein . 
15458965 gb | AAL00125 . 1 1 Conserved hypothetical protein [. 
18424366 ref | NP_568922 . 1 1 imidazoleglycerol -phosphate sy. 
23465711 ref |NP_696314 . 1 1 conserved hypothetical protein. 
23335751 gb | ZP_0 012 0984 . 1 1 -<hypothetical protein [Bifidob. 
13540888 ref | NP_110576 . 1 | Predicted glutamine amidotrans. 
21323555 dbj | BAB9 8182 . 1 | Predicted glutamine amidotransf. 
2650108 |gb|AAB90721.l| imidazoleglycerol -phosphate synth. . .144 



.258 3e-68 
.231 4e-60, 
.221 4e-57 
.216 le-55- 
le-52 
le-47 
le-47 
5e-47 
6e-47 
6e-46 
le-43 
176 2e-43 
176 2e-43 
2e-43 
7e-43 
2e-42 
3e-42 
2e-39 
2e-39 
6e-39 
7e-39 
8e-39 
8e-38 
.156 le-37 
.156 2e-37 
2e-37 
3e-36 
5e-36 
149 2e-35 
145 4e-34 
8e-34 



160 
160 
160 
157 



155 
152 
151 
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Si I 

gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
gi 
'• gi 
gi 



20515816 |gb| AAM24079.l| predicted glutamine amidotransf e . . 
16081194 |ref |NP_393487.l| hypothetical protein [Thermopl . . 
2128068 |pir | |C64507 hypothetical protein homolog MJ1661 . . 
5103636 | dbj |BAA79157.l| 186aa long hypothetical protein . . 
25314111 pir||C84409 imidazoleglycerol -phosphate synthas . . 
15023464 gb|AAK78573 . 1 1 AE007574_11 Glutamine amidotransp. . 
20094498 ref | NP_614345 . 1 | Predicted glutamine amidotrans . . 
7447523 |pir | |F69120 conserved hypothetical protein MTH19.. 
1574496 |gb|AAC23295.l| conserved hypothetical protein [H. . 
19114059 | ref |NP_593147.1 | conserved hypothetical protein.. 
12802368|gb|AAK07851.l|AF309689_13 Sno-type pyridoxine v.: 
9954418 |gb|AAG09049.l|AF294268_l pyridoxine synthesis pr 



18076241 
18076243 
25314108 
22970420 
22405886 
13937025 
18313609 
15921733 
1302459 
1084718 
1078512 



emb 

emb 

pir 

gb 

gb 

gb 



23112427 [gb 
23489463 gb 
21228132 
20089791 
23023420 
23025661 
22999839 
12229838 
23129284 
25287041 
22776228 
22776424 
14916682 
11258455 
23109765 
12229843 
23112136 
15559183 



CAC81978.l| SNO protein [Suberites domuncula] 
CAC81976.l| SNO protein [Suberites domuncula] 
(H90203 glutamine amidotransf erase, probabl . . 
ZP_00017506.l| hypothetical protein [Chlorof . . 
ZP_00000749 . 1 | hypothetical protein [Ferropl . . 
AAK50016.l|AF363613_l pyridoxine [Aspergillu. . 
ref |NP_ 560276. l| conserved hypothetical protein., 
ref |NP_377402.l| 200aa long conserved hypotheti . . 
CAA96268.1) ORF YNL334C [Saccharomyces cerev. . 
|S56195 probable membrane protein YFL060C - .. 
|S55081 hypothetical protein YMR095C - yeast.. 
ZP_00097910.l| hypothetical protein [Desulfi.. 
EAA21585.l| Glutamine amidotranspherase [Pla. 



emb 
pir 
pir 



ref |NP_634054.1 
ref |NP_615866.1 



gb 
gb 
gb 
sp 
gb 
pir 



Amidotransf erase hisH [Methano. 
imidazoleglycerol -phosphate sy. 
hypothetical protein [Leucono. 
hypothetical protein [Leucono. 
hypothetical protein [Magneto. 
Q9P4P9 | HIS5_EMENI Imidazole glycerol phospha. 
ZP_00111116.l| hypothetical protein [Nostoc . 
AE1977 amidotransf erase [imported] - Nosto. 



ZP_00062656.1 
ZP_00064612 .1 
ZP 00043799.1 



dbj |BAC12505.l| amidotransf erase [Oceanobacillu. 
dbj |BAC12700 . 1 j phosphoribosylf ormylglycinamidi . 
sp | Q9UXW5 | PURQ_PYRAB Phosphoribosylf ormylglyc in . 
pir | |H81274 amidotransf erase Cj 1315c [imported] . 
gb|ZP_00095937.l| hypothetical protein [Novosph. 
sp|Q9RDX3 |HIS5_LEGPN Imidazole glycerol phospha. 
gb | ZP_00097663 . 1 | hypothetical protein [Desulfi . 
gb|AAK58487.l| unknown [Campylobacter jejuni] 
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.143 2e-33 
.142 2e-33 
.142 3e-33 
.139 2e-32 
.139 3e-32 
.139 3e-32 
.138 3e-32 
.132 3e-30 
.131 4e-30 
.127 7e-29 
.124 7e-28 
.124 9e-28 
124 le-27 
122 2e-27 
.120 9e-27 
.120 le-26 
.117 8e-26 
.114 6e-25 
.112 3e-24 
.111 6e-24 
.103 le-21 
.103 2e-21 
.102 4e-21 
.101 7e-21 
. 85 6e-16 
. 51 8e-06 
. 50 le-05 
. 50 2e-05 . 
. 50 2e-05. 
. 49 4e-05 
. 49 6e-05 
. 48 8e-05.: 
, . 48 9e-05 
. . 47 le-04 
. . 47 2e-04 
. . 46 2e-04 
. . 46 2e-04 
. . 46 3e-04 
. . 45 4e-04 
. . 45 6e-04 
45 6e-04 



EXAMPLE 1 3 : Characterizing functional B. subtttis yaaD and yaaE homologs 
45 by complementation of E. colipdx mutants by plasmids that express the B. subtilis 
yaaD and/or yaaE homologs. 

In order that any given B. subtilis yaaD and/or yaaE homolog has similar 
function as the B. subtilis genes, the method of rescuing E. coli or other pdx auxotrophs 
can be employed. Briefly, DNA homologous to yaaD im&yaaE sequences or DNA 
50 encoding proteins homologous to YaaD and/or YaaE, identified by BLAST™ or other 
methods, can be amplified using the Polymerase Chain Reaction (PCR) with Pfu Turbo 
DNA polymerase (Stratagene Inc., used according to the manufacturers instructions) 
using appropriately designed DNA primers. In an analogous method to the 
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overexpression of the yaaDE operon in B. subtilis described in Example 4, a DNA 
fragment that contains a homologous gene or operon (Type B pathway) and artificial 
ribosome binding site can be inserted into an appropriate expression vector for example 
either of two expression vectors from Example 4. In these plasmids the yaaDE 

5 homologs would be expressed from a strong constitutive B. subtilis phage SP01 

promoter such as Pis or P 2 6- The resulting plasmids can be used to transform various E. 
coli or B. subtilis strains that contain mutations that lack function in known genes 
involved in PLP synthesis. The transfoimants can be selected for resistance to an 
appropriate antibiotic (such as ampicillin) that is encoded by the plasmid harboring the 

1 0 yaaDE homologs. Transfoimants can be tested for growth on minimal medium (SMM 
with 0.5% glucose) supplemented with 100 mg/L serine, and compared to growth of its 
respective untransformed parent on the same medium. True transfoimants (as opposed 
to revertants) that grow on minimal medium without any added B6 vitamers), can be 
concluded to contain a gene or genes that function in the biosynthesis of B6. 

15 

EXAMPLE 14: Assay for the detection of B6 vitamers. 

An assay for the detection of B6 vitamers produced in heterologous 
systems can be performed by using Sacchromyces uvarum as an indicator strain 
described in Example 1. An alternative assay is an HPLC assay. Quantitation of B6 

20 vitamers in supematants of cultures of microorganisms or extracts of organisms that 
have been genetically modified to increase the production of B6 vitamers can be 
accomplished by filtration (Centricon, 0.45 pm), followed by high pressure liquid 
chromatography (HPLC). Quantitation is accomplished by comparing samples to 
standard curves generated by running a known solution of PMP, PLP, PNP, PM , PL or 

25 PN. For the HPLC assay, filtered supematants are diluted 1/5 with 0.45pm filtered 
Buffer A (50mM KPO4, 5 mM EDTA, 2% acetonitrile, pH 7.0) and 10 \\L of this 
solution is injected onto a 250 x 4.6 mM Aqua™ 5\i C18 column run on an Agilent™ 
1 100 series HPLC equipped with a G1321 A fluorescence detector. The excitation 
wavelength is 330 nM and the emission wavelength is 380 nM. An isocratic flow is 

30 maintained on the CI 8 column for 15 minutes with buffer A at 1 mL/min. This is 
followed by a 2 minute and a 15 minute 2 % and 95% acetonitrile (in water without 
salts) wash, respectively. An equilibration step consisting of a 2% ACN wash for 5 min 
followed by a 5 minute Buffer A wash is performed in preparation for the next sample 
injection. Typical retention times for PMP, PM, PLP, PL and PN are 2.7, 4.6, 5.3, 7.8 

35 and 9.6 minutes, respectively. Chem Station™, the accompanying software package 
provided by Agilent, is utilized for instrument control, data acquisition and data 
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evaluation and run on a HP Pentium™ 4 computer that supports Microsoft™ Windows 
NT 4.0 updated with a Microsoft™ Service Pack (SP6a). 



EXAMPLE 15: Genes homologous to the type A B6 biosynthetic pathway 
5 expressed in host organisms for the production of B6. 

Preferred methods for the production of vitamin B6 feature the expression 
of a gene of the type A pathway, e.g., epd, pdxA, pdxJ, pdxF, pdxB, pdxH or dxs 
homologs, or a combination thereof, in host organisms (type A or type B organisms) 
such as E. coli, S. cerevisiae, Arabidopsis thaliana, etc. In order to identify genes or 
10 proteins that are homologous to the type A pathway, a procedure such as a protein or 
nucleotide BLAST™ search can be implemented as described in Example 12 (Altschul, 
Stephen F. (1997) Nucleic Acids Res. 25 :3389-3402). Function can further be confirmed 
as described in Example 13. 



15 Eanivalents 

Those skilled in the art will recognize, or be able to ascertain using no 
more than routine experimentation, many equivalents to the specific embodiments of the : 
invention described herein. Such equivalents are intended to be encompassed by the 
following claims. 



( 



10 



15 



20 
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What is claimed: 



1 . A method for producing a B6 vitamer comprising culturing an 
organism with an increased YaaD and/or YaaE activity as compared to the parent 
organism. 

2. The method of claim 1, wherein increased YaaD and/or YaaE 
activity is due to increased expression of a nucleic acid molecule encoding a YaaD 
polypeptide and/or a YaaE polypeptide as compared to an unmodified parent organism. 

3 . The method of claim 2, wherein increased yaaD and/or yaaE 
nucleic acid molecule expression is due to the deregulation or introduction of nucleic 
acid molecules encoding a YaaD polypeptide and/or a YaaE polypeptide into the 
organism. 

4. The method of claim 3, wherein said yaaD nucleic acid molecule 
encodes a polypeptide comprising the amino acid sequence of SEQ ID NO:21 and/or 
wherein said yaaE nucleic acid encodes a polypeptide comprising the amino acid 
sequence SEQ ID NO:23. 

5. The method of claim 3, wherein said yaaD nucleic acid encodes a 
polypeptide comprising an amino acid sequence which is at least 30% identical to the 
amino acid sequence of SEQ ID NO:21, said polypeptide having a YaaD activity. 

25 6. The method of claim 3, wherein said yaaE nucleic acid encodes a 

polypeptide comprising an amino acid sequence which is at least 30% identical to the 
amino acid sequence of SEQ ID NO:23, said polypeptide having a YaaE activity. 



7. The method of claim 3, wherein a nucleic acid molecule 
30 comprising the nucleotide sequence of SEQ ID NO:20 and/or SEQ ID NO:22 is 

introduced. 

8. The method of claim 1 , wherein the organism is selected from the 
group consisting of plants and microorganisms. 

35 

9. The method of claim 8, wherein the microorganism is selected 
from the group consisting of algae, fungi, yeast, and bacteria. 
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10. A method for producing a B6 vitamer comprising culturing an 
organism with an increased Epd, PdxA PdxJ, PdxF, PdxB, PdxH, and/or a Dxs activity 
as compared to the parent organism. 

5 

1 1 . The method of claim 10, wherein increased Epd, PdxA, PdxJ, 
PdxF, PdxB, PdxH, and/or a Dxs activity is due to increased expression of a nucleic acid 
molecule encoding an Epd, PdxA PdxJ, PdxF, PdxB, PdxH, and/or Dxs polypeptide as 
compared to an unmodified parent organism. 

10 

12. The method of claim 11, wherein increased epd, pdxA, pdxJ, 
pdxF, pdxB, pdxH, and/or dxs nucleic acid molecule expression is due to the 
introduction of nucleic acid molecules encoding an Epd, PdxA, PdxJ, PdxF, PdxB, 
PdxH, and/or Dxs polypeptide into the organism. 

15 

13. The method of claim 12, wherein said pdxA nucleic acid encodes 
a polypeptide comprising the amino acid sequence of SEQ ID NO:25 and/or wherein 
said pdxJ nucleic acid encodes a polypeptide comprising the amino acid sequence SEQ ; 
JDNO:27. 

14. The method of claim 12, wherein said pdxA nucleic acid encodes 
a polypeptide comprising an amino acid sequence which is at least 30% identical to the 
amino acid sequence of SEQ JD NO:25, said polypeptide having a PdxA activity. 

15. The method of claim 1 2, wherein said pdxJ nucleic acid encodes a 
polypeptide comprising an amino acid sequence which is at least 30% identical to the 
amino acid sequence of SEQ ID NO:27, said polypeptide having a PdxJ activity. 

16. The method of claim 12, wherein a nucleic acid molecule 
comprising the nucleotide sequence of SEQ ID NO:24 and/or SEQ ID NO:26 is 
introduced. 

1 7. The method of claim 1 0, wherein the organism is selected from 
the group consisting of plants and microorganisms. 

1 8. The method of claim 17, wherein the microorganism is selected 
from the group consisting of algae, fungi, yeast, and bacteria. 



20 
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19. An organism that has been genetically modified to comprise a 
recombinant DNA molecule that results in the increase of the activity of one or more 
enzymes that catalyze(s) a step in the biosynthesis of a B6 vitamer, such that B6 vitamer 

5 production from said modified organism is increased compared to B6 production in an 
unmodified parent organism. 

20. The organism of claim 1 9, wherein B6 vitamer production is at 
least ten-fold higher than from the unmodified parent organism. 

10 

21 . The organism of claim 19, wherein said enzyme is one or more of 
YaaD or YaaE, or a homologue thereof; wherein said homologue has the ability to 
rescue an auxotroph in a test system. 

15 22. The organism of claim 21, wherein said homologue is selected 

from the homologues listed in Table 9 or Table 10. 

23. The organism of claim 19, wherein said organism is genetically 
modified to overexpress one or more genes that encodes an enzyme that catalyzes a step 
20 in the biosynthesis of a B6 vitamer. 



24. The organism of claim 23, wherein said genes are derived from 



Bacillus. 



25 25. The organism of claim 23, wherein said genes are derived from 

Bacillus subtilis. 



30 



yaaD gene. 



yaaE gene. 



26. The organism of claim 23, wherein at least one of said genes is a 



27. The organism of claim 23, wherein at least one of said genes is a 



35 



28. The organism of claim 23, wherein at least two of said genes are 
yaaD and yaaE genes. 
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29. The organism of claim 23, wherein said organism is a Bacillus 

strain. 

30. The organism of claim 23, wherein said organism is Bacillus 

5 subtilis. 

3 1 . The organism of claim 23, wherein said organism is Escherichia 

colL 

10 32. The organism of claim 19, wherein said genes are selected from 

the group consisting of E. coli epd, pdxA, pdxJ, pdxF, pdxB, pdxH or dxs. 

33 . The organism of any one of claims 1 9-30, wherein said organism 
is grown in a liquid culture and the total B6 vitamer concentration in the culture 

15 supernatant is at least 7.0 mg/liter. 

34. A method of producing a B6 vitamer comprising culturing a 
microorganism that has been genetically modified to overexpress one or more genes that 

■ . encodes an enzyme that catalyzes a step in the biosynthesis of a B6 vitamer, such that 
20 B6 vitamer production from said modified organism is increased compared to B6 

production in an unmodified parent organism, under conditions such that the B6 vitamer- 

is produced. 

35 . The method of claim 34, wherein said enzyme is one or more of 
25 YaaDorYaaE. 

36. The method of claim 34, wherein at least one of said genes is a 

yaaD gene. 

30 37. The method of claim 34, wherein at least one of said genes is a 

yaaE gene. 

38. The method of claim 34, wherein said genes are contained on the 

yaaDE operon. 

35 

39. The method of claim 34, wherein the B6 vitamer is pyridoxine. 
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40. The method of claim 34, wherein the B6 vitamer is pyridoxal. 

41 . The method of claim 34, wherein the B6 vitamer is pyridoxamine. 

5 42. The method of claim 34, wherein said genes are bacterial- 

derived. 



43. The method of claim 34, wherein said genes are derived from 



Bacillus. 



10 



44. The method of claim 34, wherein said genes are derived from 
Bacillus subtilis. 



45. The method of claim 34, wherein the microorganism is Gram 



15 positive. 



46. The method of claim 34, wherein the microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus , Lactococci and Streptomyces. 



20 



47. The method of claim 34, wherein the microorganism is of the 



genus Bacillus. 



48. The method of claim 34, wherein the microorganism is Bacillus 



25 subtilis. 



49. The method of claim 34, further comprising recovering the B6 

vitamer. 

30 50. A method of producing a B6 vitamer comprising culturing a 

microorganism that overexpresses at least one Bacillus B6 vitamer biosynthetic gene 
under conditions such that the B6 vitamer is produced. 

5 1 . The method of claim 50, wherein the microorganism 
35 overexpresses at least one Bacillus subtilis B6 vitamer biosynthetic enzyme. 



52. The method of claim 50, wherein the B6 vitamer is pyridoxine. 
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53. The method of claim 50, wherein the B6 vitamer is pyridoxaL 

54. The method of claim 50, wherein the B6 vitamer is pyridoxamine. 



55. The method of claim 50, wherein the microorganism 
overexpresses at least two B6 vitamer biosynthetic enzymes. 

56. The method of claim 50, wherein the microorganism is Gram 

positive. 

57. The method of claim 50, wherein the microorganism is Gram 

negative. 

58. The method of claim 50, wherein the microorganism is a 
microorganism belonging to a genus selected from the group consisting of Bacillus, 
Cornyebacterium, Lactobacillus, Lactococci and Streptomyces. 

59. The method of claim 50, wherein the microorganism is of the 
genus Bacillus. 



62. A recombinant microorganism that overexpresses at least one 
Bacillus B6 vitamer biosynthetic gene. 

63 . A recombinant microorganism that overexpresses at least one 
Bacillus B6 vitamer biosynthetic enzyme. 



60. 



The method of claim 50, wherein the microorganism is Bacillus 



subtilis. 



61. 



The method of claim 50, further comprising recovering the B6 



vitamer. 



64. The method of claim 63, wherein said enzyme is YaaD or YaaE. 



65. The recombinant microorganism of claim 62 that overexpresses at 
least one Bacillus subtilis B6 vitamer biosynthetic gene. 
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66. The recombinant microorganism of claim 62, wherein the B6 
vitamer biosynthetic gene is selected from the group consisting of yaaD and yaaE. 

5 67. The recombinant microorganism of claim 62, that is Gram 

positive. 

68. The recombinant microorganism of claim 62, belonging to a 
genus selected from the group consisting of Bacillus, Cornyebacterium, Lactobacillus, 

10 Lactococci and Streptomyces. 

69. The recombinant microorganism of claim 62 belonging to the 
genus Bacillus. 

1 5 70. The recombinant microorganism of claim 62 which is Bacillus 

subtilis. 

71 . A recombinant microorganism selected from the group consisting 
ofPX14andPX17. 

20 

72. A vector comprising a nucleic acid sequence that encodes at least 
one Bacillus B6 vitamer biosynthetic gene operably linked to regulatory sequences. 

73. The vector of claim 72, comprising a nucleic acid sequence that 
25 encodes at least one Bacillus subtilis B6 vitamer biosynthetic gene. 

74. The vector of claim 72, wherein the regulatory sequences 
comprise a constitutively active promoter. 

30 75. The vector of claim 72, wherein the constitutively active promoter 

comprises P J5 (SEQ ID NO:9) or P 2 e (SEQ ID NO:10) sequences. 

76. The vector of claim 72, wherein the regulatory sequences 
comprise at least one artificial ribosome binding site (RBS). 

35 

77. A vector selected from the group consisting of pDX14R and 

pDXITR. 
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78. A recombinant microorganism comprising the vector of claim 72. 

79. An isolated nucleic acid molecule that encodes at least one 
5 Bacillus B6 vitamer biosynthetic gene. 

80. The isolated nucleic acid molecule of claim 79 that encodes at 
least one Bacillus subtilis B6 vitamer biosynthetic gene. 

10 81. An isolated Bacillus B6 vitamer biosynthetic enzyme polypeptide. 

82. An isolated Bacillus subtilis B6 vitamer biosynthetic enzyme 

polypeptide. 
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Figure 4. Structure ofpDXIF 



WO 2004/035010 




YUS2003/008880 



1/17 



SEQUENCE LISTING 



<110> Omnigene Bioproducts, Inc., et al. 

<12 0> METHODS AND ORGANISMS FOR PRODUCTION OF B6 VITAMERS 
<130> BGI-152PC 

<150> 60/367089 
<151> 2002-03-22 

<150> 60/367863 
<151> 2002-03-25 

<150> 60/368618 
<151> 2002-03-29 



<160> 27 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<21I> 46 
<212> DNA 

<213> Artificial Sequence 

<220> / 
<223> primer 

<400> 1 

ccctctagag gaggagaaaa catggctcaa acaggtactg aacgtg 

<210> 2 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 2 

cccggatcct caactgtttt atacaagtgc cttttgctta tattc 

<210> 3 
<211> 49 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> primer 
<400> 3 

ccctctagag gaggagacat aatggaacgt acaacgaatt ttaacgcag 

<210> 4 
<211> 41 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> primer 
<400> 4 

cccggatccc ggacggtttg catagccaga ctttttactc a 41 

<210> 5 
<211> 5322 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid - pDXIF 
<400> 5 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 12 0 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctcc cggccgccat 24 0 
ggccgcggga tccctctaga ggaggagaaa acatggctca aacaggtact gaacgtgtaa 300 
aacgcggaat ggcagaaatg caaaaaggcg gcgtcatcat ggacgtcatc aatgcggaac 360 
aagcgaaaat cgctgaagaa gctggagctg tcgctgtaat ggcgctagaa cgtgtgccag 420 
cagatattcg cgcggctgga ggagttgccc gtatggctga ccctacaatc gtggaagaag 480 
taatgaatgc agtatctatc ccggtaatgg caaaagcgcg tatcggacat attgttgaag 540 
cgcgtgtgct tgaagctatg ggtgttgact atattgatga aagtgaagtt ctgacgccgg 600 
ctgacgaaga atttcattta aataaaaatg aatacacagt tccttttgtc tgtggctgcc 660 
gtgatcttgg tgaagcaaca cgccgtattg cggaaggtgc ttctatgctt cgcacaaaag 720 
gtgagcctgg aacaggtaat attgttgagg ctgttcgcca tatgcgtaaa gttatcacta 780 
gtgcggccgc ctgcaggtcg accatatggg agaggcggcc gcgtcgacca atagttaccc 840 
ttattatcaa gataagaaag aaaaggattt ttcgctacgc tcaaatcctt taaaaaaaca 900 
caaaagacca cattttttaa tgtggtcttt attcttcaac taaagcaccc attagttcaa 960 
caaacgaaaa ttggataaag tgggatattt ttaaaatata tatttatgtt acagtaatat 1020 
tgacttttaa aaaaggattg attctaatga agaaagcaga caagtaagcc tcctaaattc 1080 
actttagata aaaatttagg aggcatatca aatgaacttt aataaaattg atttagacaa 1140 
ttggaagaga aaagagatat ttaatcatta tttgaaccaa caaacgactt ttagtataac 12 00 
cacagaaatt gatattagtg ttttataccg aaacataaaa caagaaggat ataaatttta 1260 
ccctgcattt attttcttag tgacaagggt gataaactca aatacagctt ttagaactgg 132 0 
ttacaatagc gacggagagt taggttattg ggataagtta gagccacttt atacaatttt 1380 
tgatggtgta tctaaaacat tctctggtat ttggactcct gtaaagaatg acttcaaaga 1440 
gttttatgat ttataccttt ctgatgtaga gaaatataat ggttcgggga aattgtttcc 1500 
caaaacacct atacctgaaa atgctttttc tctttctatt attccatgga cttcatttac 1560 
tgggtttaac ttaaatatca ataataatag taattacctt ctacccatta ttacagcagg 1620 
aaaattcatt aataaaggta attcaatata tttaccgcta tctttacagg tacatcattc 1680 
tgtttgtgat ggttatcatg caggattgtt tatgaactct attcaggaat tgtcagatag 1740 
gcctaatgac tggcttttat aatatgagat aatgccgact gtacttttta cagtcggttt 1800 
tctaatgtca ctaacctgcc ccgttagttg aagaaggttt ttatattaca gctccggtac 1860 
cgggctccca acgcgttgga tgcatagctt gagtattcta tagtgtcacc taaatagctt 1920 
ggcgtaatca tggtcatagc tgtttcctgt gtgaaattgt tatccgaaca ataggtgtac 1980 
taggacttca aggagcagtt agagagcaca tccatgcgat tgaagcatgc ggcgcggctg 204 0 
gtcttgtcgt aaaacgtccg gagcagctga acgaagttga cgggttgatt ttgccgggcg 2100 
gtgagagcac gacgatgcgc cgtttgatcg atacgtatca attcatggag ccgcttcgtg 2160 
aattcgctgc tcagggcaaa ccgatgtttg gaacatgtgc cggattaatt atattagcaa 2220 
aagaaattgc cggttcagat aatcctcatt taggtcttct gaatgtggtt gtagaacgta 2280 
attcatttgg ccggcaggtt gacagctttg aagctgattt aacaattaaa ggcttggacg 2340 
agccttttac tggggtattc atccgtgctc cgcatatttt agaagctggt gaaaatgttg 2400 
aagttctatc ggagcataat ggtcgtattg tagccgcgaa acaggggcaa ttccttggct 2460 
gctcattcca tccggagctg acagaagatc accgagtgac gcagctgttt gttgaaatgg 2520 
ttgaggaata taagcaaaag gcacttgtat aaaacagttg aggatccggg atcactagtg 2580 
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cggccgcctg caggtcgacc atatgggaga gctcccaacg cgttggatgc atagcttgag 2640 
tattctatag tgtcacctaa atagcttggc gtaatcatgg tcatagctgt ttcctgtgtg 2700 
aaattgttat ccgctcacaa ttccacacaa catacgagcc ggaagcataa agtgtaaagc 2760 
ctggggtgcc taatgagtga gctaactcac attaattgcg ttgcgctcac tgcccgcttt 2820 
ccagtcggga aacctgtcgt gccagctgca ttaatgaatc ggccaacgcg cggggagagg 2880 
cggtttgcgt attgggcgct cttccgcttc ctcgctcact gactcgctgc gctcggtcgt 2940 
tcggctgcgg cgagcggtat cagctcactc aaaggcggta atacggttat ccacagaatc 3000 
aggggataac gcaggaaaga acatgtgagc aaaaggccag caaaaggcca ggaaccgtaa 3060 
aaaggccgcg ttgctggcgt ttttcgatag gctccgcccc cctgacgagc atcacaaaaa 3120 
tcgacgctca agtcagaggt ggcgaaaccc gacaggacta taaagatacc aggcgtttcc 3180 
ccctggaagc tccctcgtgc gctctcctgt tccgaccctg ccgcttaccg gatacctgtc 3240 
cgcctttctc ccttcgggaa gcgtggcgct ttctcatagc tcacgctgta ggtatctcag 3300 
ttcggtgtag gtcgttcgct ccaagctggg ctgtgtgcac gaaccccccg ttcagcccga 3360 
ccgctgcgcc ttatccggta actatcgtct tgagtccaac ccggtaagac acgacttatc 3420 
gccactggca gcagccactg gtaacaggat tagcagagcg aggtatgtag gcggtgctac 3480 
agagttcttg aagtggtggc ctaactacgg ctacactaga aggacagtat ttggtatctg 3540 
cgctctgctg aagccagtta ccttcggaaa aagagttggt agctcttgat ccggcaaaca 3600 
aaccaccgct ggtagcggtg gtttttttgt ttgcaagcag cagattacgc gcagaaaaaa 3660 
aggatctcaa gaagatcctt tgatcttttc tacggggtct gacgctcagt ggaacgaaaa 3720 
ctcacgttaa gggattttgg tcatgagatt atcaaaaagg atcttcacct agatcctttt 3780 
aaattaaaaa tgaagtttta aatcaatcta aagtatatat gagtaaactt ggtctgacag 3840 
ttaccaatgc ttaatcagtg aggcacctat ctcagcgatc tgtctatttc gttcatccat 3900 
agttgcctga ctccccgtcg tgtagataac tacgatacgg gagggcttac catctggccc 3960 
cagtgctgca atgataccgc gagacccacg ctcaccggct ccagatttat cagcaataaa 4020 
ccagccagcc ggaagggccg agcgcagaag tggtcctgca actttatccg cctccatcca 4080 
gtctattaat tgttgccggg aagctagagt aagtagttcg ccagttaata gtttgcgcaa 4140 
cgttgttggc attgctacag gcatcgtggt gtcacgctcg tcgtttggta tggcttcatt 42 00 
cagctccggt tcccaacgat caaggcgagt tacatgatcc cccatgttgt gcaaaaaagc 4260 
ggttagctcc ttcggtcctc cgatcgttgt cagaagtaag ttggccgcag tgttatcact 4320 
catggttatg gcagcactgc ataattctct tactgtcatg ccatccgtaa gatgcttttc 4380 
tgtgactggt gagtactcaa ccaagtcatt ctgagaatac cgcgcccggc gaccgagttg 444 0 
ctcttgcccg gcgtcaatac gggataatag tgtatgacat agcagaactt taaaagtgct 4500 
catcattgga aaacgttctt cggggcgaaa actctcaagg atcttaccgc tgttgagatc 4560 
cagttcgatg taacccactc gtgcacccaa ctgatcttca gcatctttta ctttcaccag 4620 
cgtttctggg tgagcaaaaa caggaaggca aaatgccgca aaaaagggaa taagggcgac 4680 
acggaaatgt tgaatactca tactcttcct ttttcaatat tattgaagca tttatcaggg 4740 
ttattgtctc atgagcggat acatatttga atgtatttag aaaaataaac aaataggggt 4800 
tccgcgcaca tttccccgaa aagtgccacc tgtatgcggt gtgaaatacc gcacagatgc 4860 
gtaaggagaa aataccgcat caggcgaaat tgtaaacgtt aatattttgt taaaattcgc 4920 
gttaaatatt tgttaaatca gctcattttt taaccaatag gccgaaatcg gcaaaatccc 4980 
ttataaatca aaagaataga ccgagatagg gttgagtgtt gttccagttt ggaacaagag 5040 
tccactatta aagaacgtgg actccaacgt caaagggcga aaaaccgtct atcagggcga 5100 
tggcccacta cgtgaaccat cacccaaatc aagttttttg cggtcgaggt gccgtaaagc 5160 
tctaaatcgg aaccctaaag ggagcccccg atttagagct tgacggggaa agccggcgaa 522 0 
cgtggcgaga aaggaaggga agaaagcgaa aggagcgggc gctagggcgc tggcaagtgt 5280 
agcggtcacg ctgcgcgtaa ccaccacacc cgccgcgctt aa 5322 



<210> 6 
<211> 5297 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> plasmid - pDXHF 
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<400> 6 

tgcgccgcta cagggcgcgt ccattcgcca ttcaggctgc gcaactgttg ggaagggcga 60 
tcggtgcggg cctcttcgct attacgccag ctggcgaaag ggggatgtgc tgcaaggcga 120 
ttaagttggg taacgccagg gttttcccag tcacgacgtt gtaaaacgac ggccagtgaa 180 
ttgtaatacg actcactata gggcgaattg ggcccgacgt cgcatgctcc cggccgccat 240 
ggccgcggga tccctctaga ggaggagaca taatggaacg tacaacgaat tttaacgcag 300 
gtcctgcagc gctgccactg gaagttctgc aaaaagcaca gaaagaattt attgatttta 360 
acgaatccgg catgtctgtt atggagcttt cccaccgcag caaagagtat gaagcggtgc 420 
accaaaaagc gaaaagcctc ttaatcgaac tgatgggcat tccggaagat tacgatatct 480 
tgtttcttca aggcggggca agccttcaat tctcaatgct tccgatgaac tttttaacac 540 
ctgaaaaaac cgcacatttt gtgatgaccg gcgcttggtc tgaaaaagca ctggcagaaa 600 
cgaaactgtt cgggaacacg tctatcaccg ctacaagtga aacagacaat tacagttata 660 
ttccagaggt tgaccttacg gatgtaaaag acggcgcata tttacatatc acatccaaca 720 
atacaatttt cggcactcag tggcaggagt ttccgaattc tccaattccg ctcgtagccg 780 
acatgtccag cgatatttta agcagaaaaa tcgatgtgtc caaatttgat gtgatctacg 840 
gaggcatcac tagtgcggcc gcctgcaggt cgaccatatg ggagaggcgg ccgcgtcgac 900 
caatagttac ccttattatc aagataagaa agaaaaggat ttttcgctac gctcaaatcc 960 
tttaaaaaaa cacaaaagac cacatttttt aatgtggtct ttattcttca actaaagcac 1020 
ccattagttc aacaaacgaa aattggataa agtgggatat ttttaaaata tatatttatg 1080 
ttacagtaat attgactttt aaaaaaggat tgattctaat gaagaaagca gacaagtaag 1140 
cctcctaaat tcactttaga taaaaattta ggaggcatat caaatgaact ttaataaaat 1200 
tgatttagac aattggaaga gaaaagagat atttaatcat tatttgaacc aacaaacgac 1260 
ttttagtata accacagaaa ttgatattag tgttttatac cgaaacataa aacaagaagg 1320 
atataaattt taccctgcat ttattttctt agtgacaagg gtgataaact caaatacagc 1380 
ttttagaact ggttacaata gcgacggaga gttaggttat tgggataagt tagagccact 1440 
ttatacaatt tttgatggtg tatctaaaac attctctggt atttggactc ctgtaaagaa 1500 
tgacttcaaa gagttttatg atttatacct ttctgatgta gagaaatata atggttcggg 1560 
gaaattgttt cccaaaacac ctatacctga aaatgctttt tctctttcta ttattccatg 1620 
gacttcattt actgggttta acttaaatat caataataat agtaattacc ttctacccat 1680 
tattacagca ggaaaattca ttaataaagg taattcaata tatttaccgc tatctttaca 1740 
ggtacatcat tctgtttgtg atggttatca tgcaggattg tttatgaact ctattcagga 1800 
attgtcagat aggcctaatg actggctttt ataatatgag ataatgccga ctgtactttt 1860 
tacagtcggt tttctaatgt cactaacctg ccccgttagt tgaagaaggt ttttatatta 1920 
cagctccggt accgggctcc caacgcgttg gatgcatagc ttgagtattc tatagtgtca 1980 
cctaaatagc ttggcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgta 2040 
gtcatcatga aaaaaagctg gctgcaaaat gaaaatgcga acgtcccaaa aatcttgaaa 2100 
tattccacgc atgtcaaagc ggattcactc tacaacactc cgccgacatt tgcgatttat 2160 
atgctgagcc tcgttctgga atggctcaag gaaaacggcg gtgtggaagc tgttgaacag 2220 
cgcaatgaac aaaaagcgca ggttctctac agctgtattg atgaaagcaa cggcttctat 2280 
aaaggacatg ccagaaaaga cagccgctca cgcatgaatg tcacattcac gcttcgggat 2340 
gacgaattaa cgaaaacatt cgttcagaaa gcaaaagatg cgaagatgat cggccttggc 2400 
ggacaccgtt cggtgggagg ctgccgcgct tctatttata acgcggtctc tctcgaagac 2460 
tgtgaaaaat tagctgcgtt catgaagaaa ttccagcagg aaaatgagta aaaagtctgg 2520 
ctatgcattc cgtccgggat ccgggatcac tagtgcggcc gcctgcaggt cgaccatatg 2580 
ggagagctcc caacgcgttg gatgcatagc ttgagtattc tatagtgtca cctaaatagc 2640 
ttggcgtaat catggtcata gctgtttcct gtgtgaaatt gttatccgct cacaattcca 2700 
cacaacatac gagccggaag cataaagtgt aaagcctggg gtgcctaatg agtgagctaa 2760 
ctcacattaa ttgcgttgcg ctcactgccc gctttccagt cgggaaacct gtcgtgccag 2820 
ctgcattaat gaatcggcca acgcgcgggg agaggcggtt tgcgtattgg gcgctcttcc 2880 
gcttcctcgc tcactgactc gctgcgctcg gtcgttcggc tgcggcgagc ggtatcagct 2940 
cactcaaagg cggtaatacg gttatccaca gaatcagggg ataacgcagg aaagaacatg 3000 
tgagcaaaag gccagcaaaa ggccaggaac cgtaaaaagg ccgcgttgct ggcgtttttc 3 060 
gataggctcc gcccccctga cgagcatcac aaaaatcgac gctcaagtca gaggtggcga 3120 
aacccgacag gactataaag ataccaggcg tttccccctg gaagctccct cgtgcgctct 3180 
cctgttccga ccctgccgct taccggatac ctgtccgcct ttctcccttc gggaagcgtg 3240 
gcgctttctc atagctcacg ctgtaggtat ctcagttcgg tgtaggtcgt tcgctccaag 3300 
ctgggctgtg tgcacgaacc ccccgttcag cccgaccgct gcgccttatc cggtaactat 3360 
cgtcttgagt ccaacccggt aagacacgac ttatcgccac tggcagcagc cactggtaac 3420 
aggattagca gagcgaggta tgtaggcggt gctacagagt tcttgaagtg gtggcctaac 3480 
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tacggctaca 

ggaaaaagag 

tttgtttgca 

ttttctacgg 

agattatcaa 

atctaaagta 

cctatctcag 

ataactacga 

ccacgctcac 

agaagtggtc 

agagtaagta 

gtggtgtcac 

cgagttacat 

gttgtcagaa 

tctcttactg 

tcattctgag 

aatagtgtat 

cgaaaactct 

cccaactgat 

aggcaaaatg 

ttcctttttc 

tttgaatgta 

ccacctgtat 

gaaattgtaa 

ttttttaacc 

atagggttga 

aacgtcaaag 

aaatcaagtt 

ccccgattta 

gcgaaaggag 

acacccgccg 



ctagaaggac 

ttggtagctc 

agcagcagat 

ggtctgacgc 

aaaggatctt 

tatatgagta 

cgatctgtct 

tacgggaggg 

cggctccaga 

ctgcaacttt 

gttcgccagt 

gctcgtcgtt 

gatcccccat 

gtaagttggc 

tcatgccatc 

aataccgcgc 

gacatagcag 

caaggatctt 

cttcagcatc 

ccgcaaaaaa 

aatattattg 

tttagaaaaa 

gcggtgtgaa 

acgttaatat 

aataggccga 

gtgttgttcc 

ggcgaaaaac 

ttttgcggtc 

gagcttgacg 

cgggcgctag 

cgcttaa 



agtatttggt 

ttgatccggc 

tacgcgcaga 

tcagtggaac 

cacctagatc 

aacttggtct 

atttcgttca 

cttaccatct 

tttatcagca 

atccgcctcc 

taatagtttg 

tggtatggct 

gttgtgcaaa 

cgcagtgtta 

cgtaagatgc 

ccggcgaccg 

aactttaaaa 

accgctgttg 

ttttactttc 

gggaataagg 

aagcatttat 

taaacaaata 

ataccgcaca 

tttgttaaaa 

aatcggcaaa 

agtttggaac 

cgtctatcag 

gaggtgccgt 

gggaaagccg 

ggcgctggca 



atctgcgctc 
aaacaaacca 
aaaaaaggat 
gaaaactcac 
cttttaaatt 
gacagttacc 
tccatagttg 
ggccccagtg 
ataaaccagc 
atccagtcta 
cgcaacgttg 
tcattcagct 
aaagcggtta 
tcactcatgg 
ttttctgtga 
agttgctctt 
gtgctcatca 
agatccagtt 
accagcgttt 
gcgacacgga 
cagggttatt 

ggggttccgc 

gatgcgtaag 
ttcgcgttaa 
atcccttata 
aagagtccac 
ggcgatggcc 
aaagctctaa 
gcgaacgtgg 
agtgtagcgg 



tgctgaagcc 

ccgctggtag 

ctcaagaaga 

gttaagggat 

aaaaatgaag 

aatgcttaat 

cctgactccc 

ctgcaatgat 

cagccggaag 

ttaattgttg 

ttggcattgc 

ccggttccca 

gctccttcgg 

ttatggcagc 

ctggtgagta 

gcccggcgtc 

ttggaaaacg 

cgatgtaacc 

ctgggtgagc 

aatgttgaat 

gtctcatgag 

gcacatttcc 

gagaaaatac 

atatttgtta 

aatcaaaaga 

tattaaagaa 

cactacgtga 

atcggaaccc 

cgagaaagga 

tcacgctgcg 



agttaccttc 

cggtggtttt 

tcctttgatc 

tttggtcatg 

ttttaaatca 

cagtgaggca 

cgtcgtgtag 

accgcgagac 

ggccgagcgc 

ccgggaagct 

tacaggcatc 

acgatcaagg 

tcctccgatc 

actgcataat 

ctcaaccaag 

aatacgggat 

ttcttcgggg 

cactcgtgca 

aaaaacagga 

actcatactc 

cggatacata 

ccgaaaagtg 

cgcatcaggc 

aatcagctca 

atagaccgag 

cgtggactcc 

accatcaccc 

taaagggagc 

agggaagaaa 

cgtaaccacc 



3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5297 



<210> 7 
<211> 6731 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid - pDX14R 
<400> 7 

ttgcggccgc ttcgaaagct gtaatataaa 
gacattagaa aaccgactgt aaaaagtaca 
tcattaggcc tatctgacaa ttcctgaata 
tcacaaacag aatgatgtac ctgtaaagat 
atgaattttc ctgctgtaat aatgggtaga 
ttaaacccag taaatgaagt ccatggaata 
ggtgttttgg gaaacaattt ccccgaacca 
tcataaaact ctttgaagtc attctttaca 
acaccatcaa aaattgtata aagtggctct 
ctattgtaac cagttctaaa agctgtattt 
aatgcagggt aaaatttata tccttcttgt 
atttctgtgg ttatactaaa agtcgtttgt 
ctcttccaat tgtctaaatc aattttatta 
atctaaagtg aatttaggag gcttacttgt 
taaaagtcaa tattactgta acataaatat 
tttcgtttgt tgaactaatg ggtgctttag 



aaccttcttc aactaacggg gcaggttagt 60 
gtcggcatta tctcatatta taaaagccag 12 0 
gagttcataa acaatcctgc atgataacca 180 
agcggtaaat atattgaatt acctttatta 240 
aggtaattac tattattatt gatatttaag 3 00 
atagaaagag aaaaagcatt ttcaggtata 360 
ttatatttct ctacatcaga aaggtataaa 420 
ggagtccaaa taccagagaa tgttttagat 480 
aacttatccc aataacctaa ctctccgtcg 540 
gagtttatca cccttgtcac taagaaaata 600 
tttatgtttc ggtataaaac actaatatca 660 
tggttcaaat aatgattaaa tatctctttt 720 
aagttcattt gatatgcctc ctaaattttt 780 
ctgctttctt cattagaatc aatccttttt 840 
atattttaaa aatatcccac tttatccaat 900 
ttgaagaata aagaccacat taaaaaatgt 960 
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ggtcttttgt gtttttttaa aggatttgag cgtagcgaaa aatccttttc tttcttatct 1020 
tgataataag ggtaactatt gaattcggta ccaagagttt gtagaaacgc aaaaaggcca 1080 
tccgtcagga tggccttctg cttaatttga tgcctggcag tttatggcgg gcgtcctgcc 1140 
cgccaccctc cgggccgttg cttcgcaacg ttcaaatccg ctcccggcgg atttgtccta 1200 
ctcaggagag cgttcaccga caaacaacag ataaaacgaa aggcccagtc tttcgactga 1260 
gcctttcgtt ttatttgatg cctggcagtt ccctactctc gcatggggag accccacact 1320 
accatcggcg ctacggcgtt tcacttctga gttcggcatg gggtcaggtg ggaccaccgc 1380 
gctactgccg ccaggcaaat tctgttttat cagaccgctt ctgcgttctg atttaatctg 1440 
tatcaggctg aaaatcttct ctcatccgcc aaaacaggat cctcaactgt tttatacaag 1500 
tgccttttgc ttatattcct caaccatttc aacaaacagc tgcgtcactc ggtgatcttc 1560 
tgtcagctcc ggatggaatg agcagccaag gaattgcccc tgtttcgcgg ctacaatacg 1620 
accattatgc tccgatagaa cttcaacatt ttcaccagct tctaaaatat gcggagcacg 1680 
gatgaatacc ccagtaaaag gctcgtccaa gcctttaatt gttaaatcag cttcaaagct 1740 
gtcaacctgc cggccaaatg aattacgttc tacaaccaca ttcagaagac ctaaatgagg 1800 
attatctgaa ccggcaattt cttttgctaa tataattaat ccggcacatg ttccaaacat 1860 
cggtttgccc tgagcagcga attcacgaag cggctccatg aattgatacg tatcgatcaa 1920 
acggcgcatc gtcgtgctct caccgcccgg caaaatcaac ccgtcaactt cgttcagctg 1980 
ctccggacgt tttacgacaa gaccagccgc gccgcatgct tcaatcgcat ggatgtgctc 2040 
tctaactgct ccttgaagtc ctagtacacc tattgttaac atgtcagcag cgctcctatg 2100 
ttcttaccag ccgcgttctt gcatacgctg ttctggaagt aagtttgaga tttcaatccc 2160 
tttcattgca gtaccaagct cttttgacaa ctcagcgatt aatttgtaat cagtaaagtg 2220 
agttgttgct tccacaattg ctttcgcaaa tttagcaggg ttgtctgatt taaaaatacc 2280 
agaaccaaca aatactccgt cagcaccaag ctgcatcatg agagcagcat cagctggagt 2340 
tgctacgccg ccagcggcaa agttaacgac aggaagcttg ccgtcttttt taatttgaag 2400 
aagaagctcg taaggagcac ctaggttttt cgcttctgtc attagctcat cctcactcat 2460 
cgcaactact ttgcgcactt gagcgttaac tttacgcata tggcgaacag cctcaacaat 2520 
attacctgtt ccaggctcac cttttgtgcg . aagcatagaa gcaccttccg caatacggcg 2580 
tgttgcttca ccaagatcac ggcagccaca gacaaaagga actgtgtatt catttttatt 2 640 
taaatgaaat tcttcgtcag ccggcgtcag aacttcactt tcatcaatat agtcaacacc 2700 
catagcttca agcacacgcg cttcaacaat atgtccgata cgcgcttttg ccattaccgg 2760 
gatagatact gcattcatta cttcttccac gattgtaggg tcagccatac gggcaactcc 2 820 
tccagccgcg cgaatatctg ctggcacacg ttctagcgcc attacagcga cagctccagc 2880 
ttcttcagcg attttcgctt gttccgcatt gatgacgtcc atgatgacgc cgcctttttg 2940 
catttctgcc attccgcgtt ttacacgttc agtacctgtt tgagccatgt tttctcctcc 3000 
tctagagcgt cctgctgttg ttaagattat tataccacac cttgtagata aagtcaacaa 3060 
ctttttgcaa aatttttcag gaattttagc agaggttgtt ctggatgtag aacaaaacat 3120 
ctttccgctc ttgtgctgtt aggatatctt tcttggaagc taggtaggcc tcgagttatg 318 0 
gcagttggtt aaaaggaaac aaaaagaccg ttttcacaca aaacggtctt tttcgatttc 3240 
tttttacagt cacagccact tttgcaaaaa ccggacagct tcatgcctta taactgctgt 33 00 
ttcggtcgac ctgcaggcat gcaagcttcg cgaagcggcc gccgacgcga ggctggatgg 3360 
ccttccccat tatgattctt ctcgcttccg gcggcatcgg gatgcccgcg ttgcaggcca 342 0 
tgctgtccag gcaggtagat gacgaccatc agggacagct tcaaggatcg ctcgcggctc 34 80 
ttaccagcct aacttcgatc actggaccgc tgatcgtcac ggcgatttat gccgcctcgg 3540 
cgagcacatg gaacgggttg gcatggattg taggcgccgc cctatacctt gtctgcctcc 3600 
ccgcgttgcg tcgcggtgca tggagccggg ccacctcgac ctgaatggaa gccggcggca 3660 
cctcgctaac ggatt caeca ctccaagaat tggagccaat caattcttgc, ggagaactgt 3720 
gaatgcgcaa accaaccctt ggcagaacat atccatcgcg tccgccatct ccagcagccg 3780 
cacgcggcgc atetegggea gcgttgggtc ctggccacgg gtgcgcatga tcgtgctcct 3 840 
gtcgttgagg acccggctag gctggcgggg ttgecttact ggttagcaga atgaatcacc 3900 
gatacgegag cgaacgtgaa gcgactgctg ctgcaaaacg tctgcgacct gagcaacaac 3960 
atgaatggtc ttcggtttcc gtgtttcgta aagtctggaa aegeggaagt cagcgccctg 4020 
caccattatg ttceggatet geategcagg atgctgctgg ctaccctgtg gaacacctac 4080 
atctgtatta aegaageget ggcattgacc ctgagtgatt tttctctggt cccgccgcat 4140 
ccataccgcc agttgtttac cctcacaacg ttccagtaac cgggcatgtt catcatcagt 4200 
aacccgtatc gtgagcatcc tctctcgttt categgtate attaccccca tgaacagaaa 4260 
ttccccctta caeggaggea tcaagtgacc aaacaggaaa aaaccgccct taacatggcc 4320 
egctttatea gaagecagae attaacgett ctggagaaac tcaacgagct ggacgeggat 43 80 
gaacaggcag acatctgtga atcgcttcac gaccacgctg atgagcttta ccgcagctgc 4440 
ctcgcgcgtt tcggtgatga eggtgaaaac ctctgacaca tgcagctccc ggagacggtc 4500 
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acagcttgtc tgtaagcgga tgccgggagc agacaagccc gtcagggcgc gtcagcgggt 4560 
gttggcgggt gtcggggcgc agccatgacc cagtcacgta gcgatagcgg agtgtatact 4620 
ggcttaacta tgcggcatca gagcagattg tactgagagt gcaccatatg cggtgtgaaa 4680 
taccgcacag atgcgtaagg agaaaatacc gcatcaggcg ctcttccgct tcctcgctca 4740 
ctgactcgct gcgctcggtc gttcggctgc ggcgagcggt atcagctcac tcaaaggcgg 4800 
taatacggtt atccacagaa tcaggggata acgcaggaaa gaacatgtga gcaaaaggcc 4860 
agcaaaaggc caggaaccgt aaaaaggccg cgttgctggc gtttttccat aggctccgcc 4920 
cccctgacga gcatcacaaa aatcgacgct caagtcagag gtggcgaaac ccgacaggac 4980 
tataaagata ccaggcgttt ccccctggaa gctccctcgt gcgctctcct gttccgaccc 5040 
tgccgcttac cggatacctg tccgcctttc tcccttcggg aagcgtggcg ctttctcata 5100 
gctcacgctg taggtatctc agttcggtgt aggtcgttcg ctccaagctg ggctgtgtgc 5160 
acgaaccccc cgttcagccc gaccgctgcg ccttatccgg taactatcgt cttgagtcca 5220 
acccggtaag acacgactta tcgccactgg cagcagccac tggtaacagg attagcagag 5280 
cgaggtatgt aggcggtgct acagagttct tgaagtggtg gcctaactac ggctacacta 5340 
gaaggacagt atttggtatc tgcgctctgc tgaagccagt taccttcgga aaaagagttg 5400 
gtagctcttg atccggcaaa caaaccaccg ctggtagcgg tggttttttt gtttgcaagc 5460 
agcagattac gcgcagaaaa aaaggatctc aagaagatcc tttgatcttt tctacggggt 552 0 
ctgacgctca gtggaacgaa aactcacgtt aagggatttt ggtcatgaga ttatcaaaaa 5580 
ggatcttcac ctagatcctt ttaaattaaa aatgaagttt taaatcaatc taaagtatat 5640 
atgagtaaac ttggtctgac agttaccaat gcttaatcag tgaggcacct atctcagcga 5700 
tctgtctatt tcgttcatcc atagttgcct gactccccgt cgtgtagata actacgatac 5760 
gggagggctt accatctggc cccagtgctg caatgatacc gcgagaccca cgctcaccgg 5820 
ctccagattt atcagcaata aaccagccag ccggaagggc cgagcgcaga agtggtcctg 5880 
caactttatc cgcctccatc cagtctatta attgttgccg ggaagctaga gtaagtagtt 5940 
cgccagttaa tagtttgcgc aacgttgttg ccattgctgc aggcatcgtg gtgtcacgct 6000 
cgtcgtttgg tatggcttca ttcagctccg gttcccaacg atcaaggcga gttacatgat 6060 
cccccatgtt gtgcaaaaaa gcggttagct ccttcggtcc tccgatcgtt gtcagaagta 6120 
agttggccgc agtgttatca ctcatggtta tggcagcact gcataattct cttactgtca 6180 
tgccatccgt aagatgcttt tctgtgactg gtgagtactc aaccaagtca ttctgagaat 6240 
agtgtatgcg gcgaccgagt tgctcttgcc cggcgtcaat acgggataat accgcgccac 6300 
atagcagaac tttaaaagtg ctcatcattg gaaaacgttc ttcggggcga aaactctcaa 6360 
ggatcttacc gctgttgaga tccagttcga tgtaacccac tcgtgcaccc aactgatctt 6420 
cagcatcttt tactttcacc agcgtttctg ggtgagcaaa aacaggaagg caaaatgccg 64 80 
caaaaaaggg aataagggcg acacggaaat gttgaatact catactcttc ctttttcaat 6540 
attattgaag catttatcag ggttattgtc tcatgagcgg atacatattt gaatgtattt 6600 
agaaaaataa acaaataggg gttccgcgca catttccccg aaaagtgcca cctgacgtct 6660 
aagaaaccat tattatcatg acattaacct ataaaaatag gcgtatcacg aggccctttc 6720 
gtcttcaaga a 6731 



<210> 8 
<211> 10181 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> plasmid - pDX17R 



gaattttgcg gccgcttcga aagctgtaat ataaaaacct tcttcaacta acggggcagg 60 
ttagtgacat tagaaaaccg actgtaaaaa gtacagtcgg cattatctca tattataaaa 120 
gccagtcatt aggcctatct gacaattcct gaatagagtt cataaacaat cctgcatgat 180 
aaccatcaca aacagaatga tgtacctgta aagatagcgg taaatatatt gaattacctt 240 
tattaatgaa ttttcctgct gtaataatgg gtagaaggta attactatta ttattgatat 3 00 
ttaagttaaa cccagtaaat gaagtccatg gaataataga aagagaaaaa gcattttcag 360 
gtataggtgt tttgggaaac aatttccccg aaccattata tttctctaca tcagaaaggt 42 0 
ataaatcata aaactctttg aagtcattct ttacaggagt ccaaatacca gagaatgttt 480 
tagatacacc atcaaaaatt gtataaagtg gctctaactt atcccaataa cctaactctc 540 



<400> 8 
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cgtcgctatt gtaaccagtt ctaaaagctg tatttgagtt tatcaccctt gtcactaaga 600 
aaataaatgc agggtaaaat ttatatcctt cttgttttat gtttcggtat aaaacactaa 660 
tatcaatttc tgtggttata ctaaaagtcg tttgttggtt caaataatga ttaaatatct 720 
cttttctctt ccaattgtct aaatcaattt tattaaagtt catttgatat gcctcctaaa 780 
tttttatcta aagtgaattt aggaggctta cttgtctgct ttcttcatta gaatcaatcc 840 
ttttttaaaa gtcaatatta ctgtaacata aatatatatt ttaaaaatat cccactttat 900 
ccaattttcg tttgttgaac taatgggtgc tttagttgaa gaataaagac cacattaaaa 960 
aatgtggtct tttgtgtttt tttaaaggat ttgagcgtag cgaaaaatcc ttttctttct 1020 
tatcttgata ataagggtaa ctattgaatt cggtaccaag agtttgtaga aacgcaaaaa 1080 
ggccatccgt caggatggcc ttctgcttaa tttgatgcct ggcagtttat ggcgggcgtc 1140 
ctgcccgcca ccctccgggc cgttgcttcg caacgttcaa atccgctccc ggcggatttg 1200 
tcctactcag gagagcgttc accgacaaac aacagataaa acgaaaggcc cagtctttcg 1260 
actgagcctt tcgttttatt tgatgcctgg cagttcccta ctctcgcatg gggagacccc 1320 
acactaccat cggcgctacg gcgtttcact tctgagttcg gcatggggtc aggtgggacc 1380 
accgcgctac tgccgccagg caaattctgt tttatcagac cgcttctgcg ttctgattta 1440 
atctgtatca ggctgaaaat cttctctcat ccgccaaaac aggatcctca actgttttat 1500 
acaagtgcct tttgcttata ttcctcaacc atttcaacaa acagctgcgt cactcggtga 1560 
tcttctgtca gctccggatg gaatgagcag ccaaggaatt gcccctgttt cgcggctaca 1620 
atacgaccat tatgctccga tagaacttca acattttcac cagcttctaa aatatgcgga 1680 
gcacggatga ataccccagt aaaaggctcg tccaagcctt taattgttaa atcagcttca 1740 
aagctgtcaa cctgccggcc aaatgaatta cgttctacaa ccacattcag aagacctaaa 1800 
tgaggattat ctgaaccggc aatttctttt gctaatataa ttaatccggc acatgttcca 1860 
aacatcggtt tgccctgagc agcgaattca cgaagcggct ccatgaattg atacgtatcg 1920 
atcaaacggc gcatcgtcgt gctctcaccg cccggcaaaa tcaacccgtc aacttcgttc 1980 
agctgctccg gacgttttac gacaagacca gccgcgccgc atgcttcaat cgcatggatg 2040 
tgctctctaa ctgctccttg aagtcctagt acacctattg ttaacatgtc agcagcgctc 2100 
ctatgttctt accagccgcg ttcttgcata cgctgttctg gaagtaagtt tgagatttca 2160 
atccctttca ttgcagtacc aagctctttt gacaactcag cgattaattt gtaatcagta 2220 
aagtgagttg ttgcttccac aattgctttc gcaaatttag cagggttgtc tgatttaaaa 2280 
ataccagaac caacaaatac tccgtcagca ccaagctgca tcatgagagc agcatcagct 2340 
ggagttgcta cgccgccagc ggcaaagtta acgacaggaa gcttgccgtc ttttttaatt 24 00 
tgaagaagaa gctcgtaagg agcacctagg tttttcgctt ctgtcattag ctcatcctca 24 60 
ctcatcgcaa ctactttgcg cacttgagcg ttaactttac gcatatggcg aacagcctca 2 52 0 
acaatattac ctgttccagg ctcacctttt gtgcgaagca tagaagcacc ttccgcaata 2580 
cggcgtgttg cttcaccaag atcacggcag ccacagacaa aaggaactgt gtattcattt 2640 
ttatttaaat gaaattcttc gtcagccggc gtcagaactt cactttcatc aatatagtca 2700 
acacccatag cttcaagcac acgcgcttca acaatatgtc cgatacgcgc ttttgccatt 2760 
accgggatag atactgcatt cattacttct tccacgattg tagggtcagc catacgggca 2 82 0 
actcctccag ccgcgcgaat atctgctggc acacgttcta gcgccattac agcgacagct 2880 
ccagcttctt cagcgatttt cgcttgttcc gcattgatga cgtccatgat gacgccgcct 2 94 0 
ttttgcattt ctgccattcc gcgttttaca cgttcagtac ctgtttgagc catgttttct 3000 
cctcctctag aacaggcggg gttgcccccg cctgtaatta aattattaca caccctgtag 3060 
ggaaagtcaa tacctttttg taaaattttt acacagcgtg gatctcttct agggacacct 3120 
ctttgtaccc ctcaagggag aaatattggc ggtactgagc acagttttgg ttggtggaca 3180 
gtgaaccata gctgtcgtca atagcctcga gttatggcag ttggttaaaa ggaaacaaaa 3240 
agaccgtttt cacacaaaac ggtctttttc gatttctttt tacagtcaca gccacttttg 3300 
caaaaaccgg acagcttcat gccttataac tgctgtttcg gtcgacgatg atctgccgtt 3360 
ttcttctgca agccaaaaaa ccttccgtta caacgagaag gattcttcac tttctaaagt 3420 
tcggcgagtt tcatccctct gtcccagtcc ttttttggat caaggcagac tgctgcaatg 3480 
tctatctatt ttaataatag gtgcagttcg caggcgatac tgcccaatgg aagtatacca 3540 
aaatcaacgg gcttgtacca acacattagc ccaattcgat atcggcagaa tagatttttt 3600 
taatgccttc gttcgtttct aaaagcagaa cgccttcatc atctatacct aacgccttac 3660 
cgtaaaaggt tccgtttaac gttctggctc tcatattagt gccaataccg agcgcatagc 3720 
tttcccataa aagcttaatc ggcgtaaatc cgtgcgtcat ataatcccgg taccgtttct 3780 
caaagcatag taaaatatgc tggatgacgc cggcccgatc aattttttcc ccagcagctt 3 840 
ggctgaggct tgtcgcgatg tccttcaatt catctggaaa atcattaggc tgctggttaa 3900 
cgttaatgcc gatcccaatg atcactgaac gtacgcggtc ttcttcagcc tgcatttccg 3960 
ttaggatacc gactgttttt tttccgttaa tcaaaatatc atttggccat ttaatatccg 4020 
tttggatgcc tgctgcctct tctattccct gcacaacagc tactgcagca agcagagtca 4080 
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gctgcggtgt tttttggagc ggaatgtcag gccgcaaaat caggctcatc caaacaccgt 4140 
ttccttcttg agaatgccat accctagaca ttcggcccct tccggctgtt tgtttgtcag 4200 
ccaccacaag ggtgccttcc ggtgcgttat tattcgcgag ctcatgagcc gttttttgcg 4260 
tgcttgaaag aacgtcatgg taaataagat gctggcccat cacttccgtt tttaatccaa 4320 
aacgaatttc gctttcactg agttttccgg gttttttgat gagccgatat ccttttcttc 4380 
taacggcttc tacttcataa ccctctttcc gaagctcttc aatatgcttc cacacagcag 4440 
ttcttgaaca gccgagagca tcactgattt tttggccgga aataaattca ttgccggcct 4500 
gagaaaataa ttcaataagg tcttttctta atgttgaccg catgtcttca gccactcctc 4560 
tatgtgtttc ttttgattgg agagcttccc tgtcacaaca gcctgctcga tccactgtaa 4620 
ttcttctgac acccattttc cggccggccg gtttcgaagc gcaagcaagt ccttacccgt 4680 
gatatcaaga tccttaaggc ttttgatcgg caggttttga taagcgtact gaatgtcctt 4740 
cagtttcttt tcatccagtt tttcgttttg ccgaagctgc gatattttgg ccgctgagag 4800 
cagtgctttt ttcccagctc tgtacattgt cattgcgtca aggctctggc caaacgtatc 4860 
ggcaatgtga atggcttcct tgatcacttt tcccgggagc ttccaggctt tcaggaaaag 4920 
gggcgcgtct ttcaaaacta tgccaaggtt aattaaaaga gcagcccaaa gctcctcacg 4980 
ggatgttaaa gagaagaatg gaaactcact cgttgaaatc aggttttctc gtttatgata 5040 
aaaaccagga agctcttcat acaatctcgt ttgaat'gagt gtttgaagcg cctggcgaga 5100 
agctcttccc tgcagcaatt tctcaaactc tatagttttt cgttcgactg aaacatggga 5160 
gaggagtgat ttttctttcg caatggcttc ttctgtttcc ggtgaaagcg taaagccaag 5220 
ctggctcata aagcgtacgg ctctcagcat acgaagcgca tcctcttgaa atctatcctc 5280 
aggctttcca acggttcgaa tcactttctg atcaatatct ttcttgccgc caaaataatc 5340 
aagcaccttc ccgtccgctg toatggccat cgcattgatc gttaaatctc tgcgttttag 5400 
atcctcttct aatgatgaga taaattgcac ttctgacggt cttctgaaat caacataatc 5460 
agattcagtc cggaatgtcg tgacttcata ggtttcatcc tcccagagca caataatggt 5520 
cccgtgctct ttgcctacat caacagtccg ctgaaacagc cgttctactt gatcaggtgc 5580 
cgcatctgtc gcgatatcga catctccgat cgttcgtttc atatagctgt cacgaactgc 5640 
gcccccgaca aaataagcct gatggcccgc ttcgattaag atgcggagca cgggaagtgc 5700 
tttgataaaa actttttcca tgtgatcact ccggttctgc taaatcggca taaatctgtt 5760 
catactggct gacaattttt ttagaagaaa attcattttc aagcatctct attgccgcct 5820 
ttgtaaaacg attgcttagc tgttcatctt ctaaaatgct catcgcgcgg gctgttgcgg 5880 
ccgtaacatc accgacatcc accaaaaatc cgctcacatt gttttttata acctcaggga 5940 
taccgccaat gtttgttcca atacaaggca ctccgcaagc catcgcttca agcaggacaa 6000 
ggccaaagct ttctttttca gatagcagca gcttcaaatc gctaatagaa taaagatctt 6060 
caacacggtc ttgatttcca agcattaaga cttggtcttc caagccatat tttctgataa 6120 
gctcgcaggc tgtcgatttc tccggaccgt ctccgactaa aagcagcttc gctttcgttt 6180 
tgccagcgat attgcggaac acacggatga catcctgcac gcgtgcagcc actggtaaca 6240 
ggattagcag agcgaggtat gtaggcggtg ctacagagtt cttgaagtgg tggcctaact 6300 
acggctacac tagaaggaca gtatttggta tctgcgctct gctgaagcca gttaccttcg 6360 
gaaaaagagt tggtagctct tgatccggca aacaaaccac cgctggtagc ggtggttttt 6420 
ttgtttgcaa gcagcagatt acgcgcagaa aaaaaggatc tcaagaagat cctttgatct 6480 
tttctacggg gtctgacgct cagtggaacg aaaactcacg ttaagggatt ttggtcatga 6540 
gattatcaaa aaggatcttc acctagatcc ttttaaatta aaaatgaagt tttaaatcaa 6600 
tctaaagtat atatgagtaa acttggtctg acagttacca atgcttaatc agtgaggcac 6660 
ctatctcagc gatctgtcta tttcgttcat ccatagttgc ctgactcccc gtcgtgtaga 6720 
taactacgat acgggagggc ttaccatctg gccccagtgc tgcaatgata ccgcgagacc 6780 
cacgctcacc ggctccagat ttatcagcaa taaaccagcc agccggaagg gccgagcgca 6840 
gaagtggtcc tgcaacttta tccgcctcca tccagtctat taattgttgc cgggaagcta 6900 
gagtaagtag ttcgccagtt aatagtttgc gcaacgttgt tgccattgct acaggcatcg 6960 
tggtgtcacg ctcgtcgttt ggtatggctt cattcagctc cggttcccaa cgatcaaggc 7020 
gagttacatg atccoccatg ttgtgcaaaa aagcggttag ctccttcggt cctccgatcg 7080 
ttgtcagaag taagttggcc gcagtgttat cactcatggt tatggcagca ctgcataatt 7140 
ctcttactgt catgccatcc gtaagatgct tttctgtgac tggtgagtac tcaaccaagt 7200 
cattctgaga atagtgtatg cggcgaccga gttgctcttg cccggcgtca atacgggata 7260 
ataccgcgcc acatagcaga actttaaaag tgctcatcat tggaaaacgt tcttcggggc 7320 
gaaaactctc aaggatctta ccgctgttga gatccagttc gatgtaaccc actcgtgcac 73 80 
ccaactgatc ttcagcatct tttactttca ccagcgtttc tgggtgagca aaaacaggaa 7440 
ggcaaaatgc cgcaaaaaag ggaataaggg cgacacggaa atgttgaata ctcatactct 
tcctttttca atattattga agcatttatc agggttattg tctcatgagc ggatacatat 
ttgaatgtat ttagaaaaat aaacaaatag gggttccgcg cacatttccc cgaaaagtgc 



7500 
7560 
7620 
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cacctgacgt 

cgaggccctt 

ggatttcgat 

gatgttaccc 

accttttgaa 

cttttttatt 

cgcaaacggg 

tttgccggct 

gtcactggct 

ctgtctatgt 

agttgctttg 

acattgtcga 

gatgtatcta 

atctaacggt 

acaagagcca 

tcgttgtttt 

actcaaaaat 

tgtttttctt 

accattcatt 

ttcaacttgg 

attgctgtaa 

aaactcatgg 

tatatttgcc 

catagagtat 

taactggaaa 

gaacttggca 

ttccacagtt 

cctactgatg 

ccttgtaggg 

ttcatgctcc 

ttcagacata 

tagtcaatga 

gacctttgct 

gtgtgttttt 

aaagataaaa 

gtattacaaa 

taaaggctta 

cgaccatcag 

ctctggcagt 

aactacccat 

tgctatgtgg 

tctgaccact 

gcagcggtat 



ctaagaaacc 
tcgtctcgca 
cacggcacga 
gagagcttgg 
tgacctttaa 
ttaaaaattt 
ctgttctggt 
gaaagcgcta 
cccgtgttgt 
gtgactgttg 
ttttactggt 
tctgttcatg 
tcttttttac 
gaacagttgt 
taagaacctc 
tgcgtgagcc 
tttgcctcaa 
agtccgttac 
tttatctggt 
aaaatcaacg 
gtgtttaaat 
tagttatttt 
ttgtgagttt 
ttgttttcaa 
agataaggca 
tagtttgtcc 
ctcgtcatca 
ttcatcatct 
ttttcaatcg 
gttaagtcat 
catctcaatt 
taattactag 
ggaaaacttg 
tttgtttata 
agaatagatc 
aggatgtcgc 
agtagcaccc 
gcacctgagt 
gaatgggggt 
aatacaagaa 
tgctatctga 
tcggattatc 
catcaacagg 



attattatca 

tgcggatcag 

tcatcgtgcg 

cacccagcct 

tagattatat 

tttcacaaaa 

gttgctagtt 

tttcttccag 

cggcagcttt 

agctgtaaca 

ttcacctgtt 

gtgaacagct 

accgttttca 

tctacttttg 

agatccttcc 

atgagaacga 

aactggtgag 

gtaggtagga 

tgttctcaag 

tatcagtcgg 

ctttacttat 

caagcattaa 

tcttttgtgt 

aagacttaac 

atatctcttc 

actggaaaat 

gctctctggt 

gagcgtattg 

tggggttgag 

agcgactaat 

ggtctaggtg 

tccttttcct 

taaattctgc 

ttcaagtggt 

ccagccctgt 

aaacgctgtt 

tcgcaagctc 

cgctgtcttt 

aaatggcact 

aagcccgtca 

ctttttgctg 

ccgtgacagg 

cttacccgtc 



tgacattaac 

tgagggtttg 

ggagggcaag 

gcgcgagcag 

tactaattaa 

cggtttacaa 

tgttatcaga 

aattgccatg 

gattcgataa 

agttgtctca 

ctattaggtg 

ttaaatgcac 

tctgtgcata 

tttgttagtc 

gtatttagcc 

accattgaga 

ctgaattttt 

atctgatgta 

ttcggttacg 

gcggcctcgc 

tggtttcaaa 

catgaactta 

tagttctttt 

atgttccaga 

actaaaaact 

ctcaaagcct 

tgctttagct 

gttataagtg 

tagtgccaca 

cgctagttca 

attttaatca 

ttgagttgtg 

tagaccctct 

tataatttat 

gtataactca 

tgctcctcta 

gggcaaatcg 

ttcgtgacat 

acaggcgcct 

cgggcttctc 

ttcagcagtt 

tcattcagac 

ttactgtcaa 



ctataaaaat 

caactgcggg 

ggctccaagg 

gggaattgat 

ttggggaccc 

gcataacggg 

atcgcagatc 

attttttccc 

gcagcatcgc 

ggtgttcaat 

ttacatgctg 

caaaaactcg 

tggacagttt 

ttgatgcttc 

agtatgttct 

tcatgcttac 

gcagttaaag 

atggttgttg 

agatccattt 

ttatcaacca 

acccattggt 

aattcatcaa 

aataaccact 

ttatatttta 

aattctaatt 

ttaaccaaag 

aatacaccat 

aacgataccg 

cagcataaaa 

tttgctttga 

ctataccaat 

ggtatctgta 

gtaaattccg 

agaataaaga 

ctactttagt 

caaaacagac 

ctgaatattc 

tcagttcgct 

tttatggatt 

agggcgtttt 

cctgccctct 

tggctaatgc 



aggcgtatca 

tcaaggatct 

atcgggcctt 

ccggtggatg 

tagaggtccc 

ttttgctgcc 

cggcttcagg 

cacgggaggc 

ctgtttcagg 

ttcatgttct 

ttcatctgtt 

taaaagctct 

tccctttgat 

actgatagat 

ctagtgtggt 

tttgcatgtc 

catcgtgtag 

gtattttgtc 

gtctatctag 

ccaatttcat 

taagcctttt 

ggctaatctc 

cataaatcct 

tgaatttttt 

tttcgcttga 

gattcctgat 

aagcattttc 

tccgttcttt 

ttagcttggt 

aaacaactaa 

tgagatgggc 

aattctgcta 

ctagaccttt 

aagaataaaa 

cagttccgca 

cttaaaaccc 

cttttgtctc 

gcgctcacgg 

catgcaagga 

atggcgggtc 

gattttccag 

acccagtaag 



7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 - 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10181 



<210> 9 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> promoter sequence 
<400> 9 

gctattgacg acagctatgg ttcactgtcc 
tatttctccc ttgaggggta caaagaggtg 
ttttacaaaa aggtattgac tttccctaca 
gcaaccccgc ctgt 



accaaccaaa actgtgctca gtaccgccaa 60 
tccctagaag agatccacgc tgtgtaaaaa 120 
gggtgtgtaa taatttaatt acaggcgggg 180 

194 
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<210> 10 
<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> promoter sequence 
<400> 10 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 
tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 
atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 

<210> 11 
<211> 127 
<212> DNA 

<213> Artificial Sequence 



<400> 11 

gaggaatcat agaattttgt caaaataatt ttattgacaa cgtcttatta acgttgatat 60 
aatttaaatt ttatttgaca aaaatgggct cgtgttgtac aataaatgta gtgaggtgga 120 
tgcaatg 127 

<210> 12 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 12 

gaaatcatat aactatacct tgattagggg gaccaagaaa tg 42 

<210> 13 
<211> 42 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 13 

caagaacgcg gctggtaaga acataggagc gctgctgaca tg 42 

<210> 14 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 



<220> 

<223> promoter sequence 



<400> 14 

tctagaaagg aggtga 



16 
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<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 15 

tctagaagga ggagaaaaca tg 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 
<400> 16 

tctagaggag gagaaaacat g 

<210> 17 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 



<400> 17 

taagaacaaa ggaggagagc tgacatg 27 

<210> 18 
<211> 27 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> ribosome binding site 
<400> 18 

taagaacaga ggaggagagc tgacatg 

<210> 19 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ribosome binding site 



<400> 19 

taagaagaaa ggaggtgagc tgacatg 



27 



WO 2004/035010 Jg^US2003/008880 



mm 13/17 



<210> 20 
<211> 1282 
<212> DNA 

<213> Bacillus subtilis 



<400> 20 

ttaatcaatt gaaagagctc tgatggatgt 
acccaatttt taataaaaaa tgattttcaa 
tgcatatgaa ttattggatt tctaggatac 
ttgattaggg ggaccaagaa atggctcaaa 
cagaaatgca aaaaggcggc gtcatcatgg 
ctgaagaagc tggagctgtc gctgtaatgg 
cggctggagg agttgcccgt atggctgacc 
tatctatccc ggtaatggca aaagcgcgta 
aagctatggg tgttgactat attgatgaaa 
ttcatttaaa taaaaatgaa tacacagttc 
aagcaacacg ccgtattgcg gaaggtgctt 
caggtaatat tgttgaggct gttcgccata 
tagttgcgat gagtgaggat gagctaatga 
agcttcttct tcaaattaaa aaagacggca 
gcgtagcaac tccagctgat gctgctctca 
ttggttctgg tatttttaaa tcagacaacc 
caacaactca ctttactgat tacaaattaa 
caatgaaagg gattgaaatc tcaaacttac 
ggtaagaaca taggagcgct gctgacatgt 
cagttagaga gcacatccat gcgattgaag 
gtccggagca gctgaacgaa gttgacgggt 
tgcgccgttt gatcgatacg ta 

<210> 21 
<211> 293 
<212> PRT 

<213> Bacillus subtilis 



tagggctctt tcgcgtaata tactgaaaaa 60 
agcgctaaaa accgaataaa atagtatata 120 
aataaggatt agaaatcata taactatacc 180 
caggtactga acgtgtaaaa cgcggaatgg 240 
acgtcatcaa tgcggaacaa gcgaaaatcg 300 
cgctagaacg tgtgccagca gatattcgcg 360 
ctacaatcgt ggaagaagta atgaatgcag 420 
tcggacatat tgttgaagcg cgtgtgcttg 480 
gtgaagttct gacgccggct gacgaagaat 540 
cttttgtctg tggctgccgt gatcttggtg 600 
ctatgcttcg cacaaaaggt gagcctggaa 660 
tgcgtaaagt taacgctcaa gtgcgcaaag 720 
cagaagcgaa aaacctaggt gctccttacg 780 
agcttcctgt cgttaacttt gccgctggcg 840 
tgatgcagct tggtgctgac ggagtatttg 900 
ctgctaaatt tgcgaaagca attgtggaag 960 
tcgctgagtt gtcaaaagag cttggtactg 1020 
ttccagaaca gcgtatgcaa gaacgcggct 1080 
taacaatagg tgtactagga cttcaaggag 1140 
catgcggcgc ggctggtctt gtcgtaaaac 1200 
tgattttgcc gggcggtgag agcacgacga 1260 

1282 



<400> 21 

Ala Gin Thr Gly Thr Glu Arg Val Lys Arg Gly Met Ala Glu Met Gin 

15 10 15 

Lys Gly Gly Val He Met Asp Val He Asn Ala Glu Gin Ala Lys He 

20 025 30 

Ala Glu Glu Ala Gly Ala Val Ala Val Met Ala Leu Glu Arg Val Pro 

35 40 45 

Ala Asp He Arg Ala Ala Gly Gly Val Ala Arg Met Ala Asp Pro Thr 

50 55 60 

He Val Glu Glu Val Met Asn Ala Val Ser He Pro Val Met Ala Lys 
65 70 75 80 

Ala Arg He Gly His He Val Glu Ala Arg Val Leu Glu Ala Met Gly 

85 90 95 

Val Asp Tyr He Asp Glu Ser Glu Val Leu Thr Pro Ala Asp Glu Glu 

100 105 HO 

Phe His Leu Asn Lys Asn Glu Tyr Thr Val Pro Phe Val Cys Gly Cys 

115 120 125 

Arg Asp Leu Gly Glu Ala Thr Arg Arg He Ala Glu Gly Ala Ser Met 

130 135 140 

Leu Arg Thr Lys Gly Glu Pro Gly Thr Gly Asn He Val Glu Ala Val 
145 ' 150 155 160 

Arg His Met Arg Lys Val Asn Ala Gin Val Arg Lys Val Val Ala Met 

165 170 175 

Ser Glu Asp Glu Leu Met Thr Glu Ala Lys Asn Leu Gly Ala Pro Tyr 
180 185 190 
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Glu 


Leu 


Leu 


Leu Gin 


He 


Lys 


Lys Asp Gly Lys 


Leu 


Pro Val Val 


TV _ _ 

Asn 






195 








200 




205 




Phe 


Ala 


Ala 


Gly Gly 


Val 


Ala 


Thr Pro Ala Asp Ala Ala Leu Met 


Met 




210 








215 




220 






Gin 


Leu 


Gly 


Ala Asp 


Gly Val 


Phe Val Gly Ser 


Gly 


He Phe Lys 


Cav 


225 








230 




235 






240 


Asp Asn 


Pro 


Ala Lys 


Phe 


Ala 


Lys Ala He Val 


Glu 


Ala Thr Thr 


±11 s 








245 






250 




255 




Phe 


Thr 


Asp 


Tyr Lys 


Leu 


He 


Ala Glu Leu Ser 


Lys 


Glu Leu Gly 


Thr 








260 






265 




270 




Ala 


Met 


Lys 


Gly lie 


Glu 


He 


Ser Asn Leu Leu 


Pro 


Glu Gin Arg 


Met 






275 








280 




285 




Gin 


Glu 


Arg 


Gly Trp 
















290 



















<210> 22 
<211> 988 
<212> DNA 
<213> Bacillus 



subtilis 



<400> 22 

ctggtatttt taaatcagac aaccctgcta 
ctcactttac tgattacaaa ttaatcgcfcg 
aagggattga aatctcaaac ttacttccag 
aacataggag cgctgctgac atgttaacaa 
gagagcacat ccatgcgatt gaagcatgcg 
agcagctgaa cgaagttgac gggttgattt 
gtttgatcga tacgtatcaa ttcatggagc 
cgatgtttgg aacatgtgcc ggattaatta 
atcctcattt aggtcttctg aatgtggttg 
acagctttga agctgattta acaattaaag 
tccgtgctcc gcatatttta gaagctggtg 
gtcgtattgt agccgcgaaa caggggcaat 
cagaagatca ccgagtgacg cagctgtttg 
cacttgtata aaacagttga aagctgtgga 
gatcgaaaag cgttgatagg aactagtagg 
ggtgcgaatc ggtgaaagat gctgtctgaa 
aggcatttac gggataaccg ttatgttt 

<210> 23 
<211> 196 
<212> PRT 

<213> Bacillus subtilis 



aatttgcgaa agcaattgtg gaagcaacaa 60 
agttgtcaaa agagcttggt actgcaatga 120 
aacagcgtat gcaagaacgc ggctggtaag 180 
taggtgtact aggacttcaa ggagcagtta 240 
gcgcggctgg tcttgtcgta aaacgtccgg 300 
tgccgggcgg tgagagcacg acgatgcgcc 360 
cgcttcgtga attcgctgct cagggcaaac 420 
tattagcaaa agaaattgcc ggttcagata 480 
tagaacgtaa ttcatttggc cggcaggttg 540 
gcttggacga gccttttact ggggtattca 600 
aaaatgttga agttctatcg gagcataatg 660 
tccttggctg ctcattccat ccggagctga 720 
ttgaaatggt tgaggaatat aagcaaaagg 780 
aacttatagt acattataag cacaaataaa 840 
gagcctctct ttctaagaga gccgatggtt 900 
tccatccttg agcgaaatgc tgaagcgagt 960 

988 



<400> 23 








Met Leu 


Thr 


He 


Gly Val 


Leu Gly 


1 






5 




He His 


Ala 


He 


Glu Ala 


Cys Gly 






20 






Pro Glu 


Gin 


Leu 


Asn Glu 


Val Asp 




35 






40 


Ser Thr 


Thr 


Met Arg Arg 


Leu He 


50 








55 


Leu Arg 


Glu 


Phe 


Ala Ala 


Gin Gly 


65 






70 




Gly Leu 


He 


He 


Leu Ala 


Lys Glu 



85 



Leu Gin Gly Ala Val Arg Glu His 

10 15 
Ala Ala Gly Leu Val Val Lys Arg 
25 30 
Gly Leu He Leu Pro Gly Gly Glu 
45 

Asp Thr Tyr Gin Phe Met Glu Pro 
60 

Lys Pro Met Phe Gly Thr Cys Ala 

75 80 
He Ala Gly Ser Asp Asn Pro His 
90 95 
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Leu Gly 


Leu 


Leu 


Asn 


Val 


Val 


Val 


Glu 


Arg 


Asn 


Ser Phe Gly Arg 


Gin 




100 










105 






110 




Val Asp 


Ser 


Phe 


Glu 


Ala 


Asp 


Leu 


Thr 


lie 


Lys 


Gly Leu Asp Glu 


Pro 


115 










120 








125 




Phe Thr 


Gly 


Val 


Phe 


lie 


Arg Ala Pro 


His 


lie 


Leu Glu Ala Gly 


Glu 


130 








135 










140 




ASH Val 


Glu 


Val 


Leu 


Ser 


Glu 


His 


Asn 


Gly Arg 


lie Val Ala Ala 


Lys 


145 








150 










155 




160 


Gin Gly 


Gin 


Phe 


Leu Gly 


Cys 


Ser 


Phe 


His 


Pro Glu Leu Thr Glu Asp 






165 










170 




175 




His Arg 


Val 


Thr 


Gin 


Leu 


Phe 


Val 


Glu 


Met 


Val 


Glu Glu Tyr Lys 


Gin 




180 










185 






190 




Lys Ala 


Leu 
195 


Val 





















<210> 24 
<211> 990 
<212> DNA 

<213> Escherichia coli 
<400> 24 

atggttaaaa cccaacgtgt tgtgatcact cccggcgagc ccgccgggat tggcccggac 60 
ttagttgtcc agcttgcaca gcgtgagtgg ccggtcgaac tggttgtttg tgccgatgcc 120 
actctcctta ccaaccgggc agcgatgctc ggtttgccgc tcaccctccg cccttattcc 180 
cccaactccc ctgcacaacc gcaaactgcg ggcacattaa cgctacttcc tgtcgcgcta 240 
cgtgcacctg tcactgcggg gcagttagcg gttgaaaatg ggcattatgt ggtggaaacg 300 
ctggcgcgag cgtgcgatgg ttgtctgaac ggcgaatttg ccgcgctgat cacaggtccg 360 
gtgcataaag gcgttattaa cgacgctggc attcctttta ccggtcatac cgagtttttc 420 
gaagagcgtt cgcaggcgaa aaaggtggtg atgatgctgg cgaccgaaga acttcgcgtg 4 80 
gcgctggcaa cgacgcattt accgctgcgc gatatcgcag acgctatcac ccctgcactt 540 
ttgcacgaag tgattgctat tttgcatcac gatttgcgga ccaaatttgg tattgccgaa 600 
ccgcgcattc tggtctgcgg gctgaatccg cacgcgggcg aaggcggtca tatgggtacg 660 
gaagagatag acaccattat tccggtgctc aatgagctgc gggcgcaggg gatgaaactc 720 
aacgggccgc tgcctgccga taccctgttt cagccgaaat atcttgataa cgccgacgcc 7 80 
gtgctggcga tgtaccacga tcagggtctt cccgtgctaa aataccaggg cttcgggcgc 840 
ggtgtgaaca ttacgctggg cctgcccttt attcgcacat cagtggacca cggcaccgcg 900 
cttgaactgg cgggacgtgg caaagccgat gtcggcagtt ttattacggc gcttaatctc 960 
gccatcaaaa tgattgttaa cacccaatga 990 

<210> 25 
<211> 329 
<212> PRT 

<213> Escherichia coli 
<400> 25 

Met Val Lys Thr Gin Arg Val Val lie Thr Pro Gly Glu Pro Ala Gly 

1 " 5 10 15 

He Gly Pro Asp Leu Val Val Gin Leu Ala Gin Arg Glu Trp Pro Val 

20 25 30 

Glu Leu Val Val Cys Ala Asp Ala Thr Leu Leu Thr Asn Arg Ala Ala 

35 40 45 

Met Leu Gly Leu Pro Leu Thr Leu Arg Pro Tyr Ser Pro Asn Ser Pro 

50 55 60 

Ala Gin Pro Gin Thr Ala Gly Thr Leu Thr Leu Leu Pro Val Ala Leu 
65 70 75 80 

Arg Ala Pro Val Thr Ala Gly Gin Leu Ala Val Glu Asn Gly His Tyr 
85 90 95 
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Val Val Glu Thr Leu Ala Arg Ala Cys Asp Gly Cys Leu Asn Gly Glu 

100 105 HO 

Phe Ala Ala Leu He Thr Gly Pro Val His Lys Gly Val He Asn Asp 

115 120 125 

Ala Gly He Pro Phe Thr Gly His Thr Glu Phe Phe Glu Glu Arg Ser 

130 135 140 

Gin Ala Lys Lys Val Val Met Met Leu Ala Thr Glu Glu Leu Arg Val 
145 150 155 160 

Ala Leu Ala Thr Thr His Leu Pro Leu Arg Asp He Ala Asp Ala He 

165 170 175 

Thr Pro Ala Leu Leu His Glu Val He Ala He Leu His His Asp Leu 

180 185 190 

Arg Thr Lys Phe Gly He Ala Glu Pro Arg He Leu Val Cys Gly Leu 

195 200 205 

Asn Pro His Ala Gly Glu Gly Gly His Met Gly Thr Glu Glu He Asp 

210 215 220 

Thr He He Pro Val Leu Asn Glu Leu Arg Ala Gin Gly Met Lys Leu 
225 230 235 240 

Asn Gly Pro Leu Pro Ala Asp Thr Leu Phe Gin Pro Lys Tyr Leu Asp 

245 250 255 

Asn Ala Asp Ala Val Leu Ala Met Tyr His Asp Gin Gly Leu Pro Val 

260 265 270 

Leu Lys Tyr Gin Gly Phe Gly Arg Gly Val Asn He Thr Leu Gly Leu 

275 280 285 

Pro Phe He Arg Thr Ser Val Asp His Gly Thr Ala Leu Glu Leu Ala 

290 295 300 

Gly Arg Gly Lys Ala Asp Val Gly Ser Phe He Thr Ala Leu Asn Leu 
305 ~ ' 310 315 320 

Ala He Lys Met He Val Asn Thr. Gin 
325 



<210> 26 
<211> 732 
<212> DNA 

<213> Escherichia coli 
<400> 26 

atggctgaat tactgttagg cgtcaacatt gaccatatcg ctacgctgcg caacgcgcgc 60 
ggtaccgctt acccggatcc ggtgcaggcc gcgtttattg ccgagcaggc gggagcggac 120 
ggcattaccg tgcatttacg tgaagatcgc cgtcacatta ctgaccgcga cgtgcgcatc 180 
ctgcgtcaga cgctggatac ccgcatgaat ctggagatgg cggtgaccga agagatgctg 24 0 
gcgatcgccg ttgagacgaa gccacatttt tgctgcctgg taccggaaaa gcgtcaggaa 300 
gtaacaaccg aaggcggcct ggatgtcgca gggcagcgtg acaaaatgcg cgatgcctgc 360 
aaacgtctgg cagatgccgg gattcaggtt tctctgttta ttgacgccga tgaagagcag 420 
atcaaagctg cggcagaggt tggcgcaccg tttatcgaga tccacaccgg ttgctatgct 4 80 
gatgccaaaa ctgacgccga acaggcgcaa gagctggcgc gtatcgccaa agccgcgacc 540 
tttgccgcaa gcctcggtct gaaagttaac gccggacacg gtctgaccta tcacaacgtg 600 
aaagccattg ccgccatccc tgagatgcat gaactgaata tcggtcatgc cattattggt 660 
cgtgcagtga tgaccggact gaaagatgcg gtggcagaaa tgaagcgtct gatgctggaa 720 
gcgcgtggct aa 732 

<210> 27 
<211> 243 
<212> PRT 

<213> Escherichia coli 



<400> 27 

Met Ala Glu Leu Leu Leu Gly Val Asn He Asp His He Ala Thr Leu 



r 
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1 



5 



10 



15 



Arg 


Asn 


Ala 


Arg 


Gly 


Thr Ala Tyr 


Pro 


Asp 


Pro 


Val Gin Ala 


Ala 


Phe 








20 






25 






30 






He 


Ala 


Glu 


Gin 


Ala 


Gly Ala Asp 


Gly 


He 


Thr 


Val His Leu Arg 


Glu 






35 






40 








45 






Asp 


Arg 


Arg 


His 


He 


Thr Asp Arg 


Asp 


Val 


Arg 


He Leu Arg 


Gin 


Thr 




50 








55 








60 






Leu 


Asp 


Thr 


Arg 


Met 


Asn Leu Glu 


Met 


Ala 


Val 


Thr Glu Glu 


Met 


Leu 


65 










70 






75 






80 


Ala 


He 


Ala 


Val 


Glu 


Thr Lys Pro 


His 


Phe 


Cys 


Cys Leu Val 


Pro 


Glu 










85 






90 






95 




Lys 


Arg 


Gin 


Glu 


Val 


Thr Thr Glu 


Gly Gly Leu 


Asp Val Ala 


Gly 


Gin 








100 






105 






110 






Arg 


Asp 


Lys 


Met 


Arg 


Asp Ala Cys 


Lys 


Arg 


Leu 


Ala Asp Ala Gly 


He 






115 






120 








125 






Gin 


Val 


Ser 


Leu 


Phe 


He Asp Ala 


Asp 


Glu 


Glu 


Gin He Lys 


Ala 


Ala 




130 








135 








140 






Ala 


Glu 


Val 


Gly 


Ala 


Pro Phe He 


Glu 


He 


His 


Thr Gly Cys 


Tyr 


Ala 


145 










150 






155 






160 


Asp 


Ala 


Lys 


Thr 


Asp 


Ala Glu Gin 


Ala 


Gin 


Glu 


Leu Ala Arg 


He 


Ala 










165 






170 






175 




Lys 


Ala 


Ala 


Thr 


Phe 


Ala Ala Ser 


Leu 


Gly Leu 


Lys Val Asn 


Ala 


Gly 








180 






185 






190 






His 


Gly 


Leu 


Thr 


Tyr 


His Asn Val 


Lys 


Ala 


He 


Ala Ala He 


Pro 


Glu 






195 






200 








205 






Met 


His 


Glu 


Leu 


Asn 


He Gly His 


Ala 


He 


He 


Gly Arg Ala Val 


Met 




210 








215 








220 






Thx 


Gly 


Leu 


Lys 


Asp 


Ala Val Ala 


Glu 


Met 


Lys 


Arg Leu Met 


Leu 


Glu 


225 










230 






235 






240 


Ala 


Arg 


Gly 





















